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NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 



1. CROSS REFERENCE TO RELATED APPLICATIONS 

This application claims the priority benefit of U.S. Provisional Application Serial No. 
5 60/324,631 filed September 24, 2001 entitled "Novel Nucleic Acids and Polypeptides", 
Attorney Docket No. 810, which is a continuation-in-part application of PCT Application 
Serial No. PCT/US00/35017 filed December 22, 2000 entitled "Novel Contigs Obtained 
from Various Libraries", Attorney Docket No. 784CIP3 A'PCT, which in turn is a 
continuation-in-part application of U.S. Application Serial No. 09/552,317 filed April 25, 

10 2000 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 

784CEP, which in turn is a continuation-in-part application of U.S. Application Serial No. 
09/488,725 filed January 21, 2000 entitled "Novel Contigs Obtained from Various 
Libraries", Attorney Docket No. 784; PCT Application Serial No. PCT/US0 1/02623 filed 
January 25, 2001 entitled "Novel Contigs Obtained from Various Libraries", Attorney 

1 5 Docket No. 785CIP3/PCT, which in turn is a continuation-in-part application of U.S. 

Application Serial No. 09/491,404 filed January 25, 2000 entitled "Novel Contigs Obtained 
from Various Libraries", Attorney Docket No. 785; PCT Application Serial No, 
PCT/US01/03800 filed February 5, 2001 entitled "Novel Contigs Obtained from Various 
Libraries", Attorney Docket No. 787CIP3/PCT, which in turn is a continuation-in-part 

20 application of U.S. Application Serial No. 09/560,875 filed April 27, 2000 entitled "Novel 
Contigs Obtained from Various Libraries", Attorney Docket No. 787CIP, which in turn is a 
continuation-in-part application of U.S. Application Serial No. 09/496,914 filed February 03, 
2000 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 787; 
PCT Application Serial No. PCT/US 01/04927 filed February 26, 2001 entitled "Novel 

25 Contigs Obtained from Various Libraries", Attorney Docket No. 788CIP3/PCT, which in 
turn is a continuation-in-part application of U.S. Application Serial No. 09/577,409 filed 
May 18, 2000 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket 
No. 788CIP, which in turn is a continuation-in-part application of U.S. Application Serial 
No. 09/515,126 filed February 28, 2000 entitled "Novel Contigs Obtained from Various 

30 Libraries", Attorney Docket No. 788; PCT Application Serial No. PCT/US01/04941 filed 
March 5, 2001 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket 
No. 789CIP3/PCT, which in turn is a continuation-in-part application of U.S. Application 
Serial No. 09/574,454 filed May 19, 2000 entitled "Novel Contigs Obtained from Various 
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Libraries", Attorney Docket No. 789CIP, which in turn is a continuation-in-part application 
of U.S. Application Serial No. 09/519,705 filed March 07, 2000 entitled 'Novel Contigs 
Obtained from Various Libraries", Attorney Docket No. 789; PCT Application Serial No. 
PCT/US0 1/08631 filed March 30, 2001 entitled "Novel Contigs Obtained from Various 
5 Libraries", Attorney Docket No. 790CIP3/PCT, which in turn is a continuation-in-part 

application of U.S. Application Serial No. 09/649,167 filed August 23, 2000 entitled "Novel 
Contigs Obtained from Various Libraries", Attorney Docket No. 790CIP, which in turn is a 
continuation-in-part application of U.S. Application Serial No. 09/540,217 filed March 31, 

2000 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 790; 
1 0 PCT Application Serial No. PCT/US01/08656 filed April 1 8, 2001 entitled "Novel Contigs 

Obtained from Various Libraries", Attorney Docket No. 791CIP3/PCT, which in turn is a 
continuation-in-part application of U.S. Application Serial No. 09/770,160 filed January 26, 

2001 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 
791CIP, which is in turn a continuation-in-part application of U.S. Application Serial No. 

1 5 09/5 52,929 filed April 18, 2000 entitled "Novel Contigs Obtained from Various Libraries", 
Attorney Docket No. 791; and PCT Application Serial No. PCT/US01/14827 filed Mayl6, 
2001 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 
792CJT3/PCT, which in turn is a continuation-in-part application of U.S. Application Serial 
No. 09/577,408 filed May 18, 2000 entitled "Novel Contigs Obtained from Various 

20 Libraries", Attorney Docket No. 792; all of which are incorporated herein by reference in 
their entirety. 

2. BACKGROUND OF THE INVENTION 

25 2.1 TECHNICAL FIELD 

The present invention provides novel polynucleotides and proteins encoded by such 
polynucleotides, along with uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and research methods. 

30 2.2 BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, such 
as lymphokines, interferons, circulating soluble factors, chemokines, and interleukins) has 
matured rapidly over the past decade. The now routine hybridization cloning and expression 
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cloning techniques clone novel polynucleotides "directly" in the sense that they rely on 
information directly related to the discovered protein (i.e., partial DNA/amino acid sequence 
of the protein in the case of hybridization cloning; activity of the protein in the case of 
expression cloning). More recent "indirect" cloning techniques such as signal sequence 
5 cloning, which isolates DNA sequences based on the presence of a now well-recognized 
secretory leader sequence motif, as well as various PCR-based or low stringency 
hybridization-based cloning techniques, have advanced the state of the art by making 
available large numbers of DNA/amino acid sequences for proteins that are known to have 
biological activity, for example, by virtue of their secreted nature in the case of leader 

10 sequence cloning, by virtue of their cell or tissue source in the case of PCR-based 

techniques, or by virtue of structural similarity to other genes of known biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications in, 
for example, diagnostics, forensics, gene mapping; identification of mutations responsible 
for genetic disorders or other traits, to assess biodiversity, and to produce many other types 

15 of data and products dependent on DNA and amino acid sequences. 

3. SUMMARY OF THE INVENTION 

The compositions of the present invention include novel isolated polypeptides, novel 
isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, 
20 cloned genes or degenerate variants thereof, especially naturally occurring variants such as 

allelic variants, antisense polynucleotide molecules, and antibodies that specifically recognize 
one or more epitopes present on such polypeptides, as well as hybridomas producing such 
antibodies. 

The compositions of the present invention additionally include vectors, including 
25 expression vectors, containing the polynucleotides of the invention, cells genetically engineered 
to contain such polynucleotides and cells genetically engineered to express such 
polynucleotides. 

The present invention relates to a collection or library of at least one novel nucleic acid 
sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 
30 hybridization (SBH), and in some cases, sequences obtained from one or more public 

databases. The invention relates also to the proteins encoded by such polynucleotides, along 
with therapeutic, diagnostic arid research utilities for these polynucleotides and proteins. These 
nucleic acid sequences are designated as SEQ ID NO: 1-971, or 1943-2709 and are provided in 
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the Sequence Listing. In the nucleic acids provided in the Sequence Listing, A is adenine; C is 
cytosine; G is guanine; T is thymine; and N is any of the four bases or unknown. In the amino 
acids provided in the Sequence Listing, * corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences 
5 that hybridize to the complement of SEQ ID NO: 1-971, or 1943-2709 under stringent 
hybridization conditions; nucleic acid sequences which are allelic variants or species 
homologues of any of the nucleic acid sequences recited above, or nucleic acid sequences that 
encode a peptide comprising a specific domain or truncation of the peptides encoded by SEQ 
ED NO: 1-971, or 1943-2709. A polynucleotide comprising a nucleotide sequence having at 
10 least 90% identity to an identifying sequence of SEQ ID NO: 1-971, or 1943-2709 or a 

degenerate variant or fragment thereof. The identifying sequence can be 100 base pairs in 
length. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of SEQ ID NO: 1-971, or 1943-2709. The 

15 sequence information can be a segment of anyoneof SEQ ID NO: 1-971, or 1943-2709 that 

uniquely identifies or represents the sequence information of SEQ ID NO: 1-971, or 1943-2709. 

A collection as used in this application can be a collection of only one polynucleotide. 
The collection of sequence information or identifying information of each sequence can be 
provided on a nucleic acid array. In one embodiment, segments of sequence information are 

20 provided on a nucleic acid array to detect the polynucleotide that contains the segment. The 
array can be designed to detect full-match or mismatch to the polynucleotide that contains the 
segment. The collection can also be provided in a computer-readable format. 

This invention also includes the reverse or direct complement of any of the nucleic acid 
sequences recited above; cloning or expression vectors containing the nucleic acid sequences; 

25 and host cells or organisms transformed with these expression vectors. Nucleic acid sequences 
(or their reverse or direct complements) according to the invention have numerous applications 
in a variety of techniques known to those skilled in the art of molecular biology, such as use as 
hybridization probes, use as primers for PCR, use in an array, use in computer-readable media, 
use in sequencing full-length genes, use for chromosome and gene mapping, use in the 

3 0 recombinant production of protein, and use in the generation of anti-sense DNA or RNA, their 
chemical analogs and the like. 

In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1-971, or 1943- 
2709 or novel segments or parts of the nucleic acids of the invention are used as primers in 
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expression assays that are well known in the art. In a particularly preferred embodiment, the 
nucleic acid sequences of SEQ ID NO: 1-971, or 1943-2709 or novel segments or parts of the 
nucleic acids provided herein are used in diagnostics for identifying expressed genes or, as well 
known in the art and exemplified by Vollrath et al„ Science 258:52-59 (1992), as expressed 
5 sequence tags for physical mapping of the human genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 
polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1-971, 
or 1943-2709; a polynucleotide comprising any of the full length protein coding sequences of 
SEQ ID NO: 1-971, or 1943-2709; and a polynucleotide comprising any of the nucleotide 

1 0 sequences of the mature protein coding sequences of SEQ ID NO: 1 -97 1 , or 1 943-2709. The 
polynucleotides of the present invention also include, but are not limited to, a polynucleotide 
that hybridizes under stringent hybridization conditions to (a) the complement of any one of the 
nucleotide sequences set forth in SEQ ID NO: 1-971, or 1943-2709; (b) a nucleotide sequence 
encoding any one of the amino acid sequences set forth in SEQ ID NO: 1-971, or 1943-2709; 

15 (c) a polynucleotide which is an allelic variant of any polynucleotides recited above; (d) a 
polynucleotide which encodes a species homologue (e.g. orthologs) of any of the proteins 
recited above; or '(e) a polynucleotide that encodes a polypeptide comprising a specific domain 
or truncation of any of the polypeptides comprising an amino acid sequence set forth in SEQ ID 
NO: 972-1942, or 2710-3476, or Tables 3, 4A, 4B, or 5. 

20 The isolated polypeptides of the invention include, but are not limited to, a polypeptide 

comprising any of the amino acid sequences set forth in the Sequence Listing; or the 
corresponding full length or mature protein. Polypeptides of the invention also include 
polypeptides with biological activity that are encoded by (a) any of the polynucleotides having 
a nucleotide sequence set forth in SEQ ID NO: 1-971, or 1943-2709; or (b) polynucleotides that 

25 hybridize to the complement of the polynucleotides of (a) under stringent hybridization 

conditions. Biologically active variants of any of the polypeptide sequences in the Sequence 
Listing, and "substantial equivalents" thereof (e.g., with at least about 65%, 70%, 75%, 80%, 
85%, 90%>, 95%, 98% or 99% amino acid sequence identity) that preferably retain biological 
activity are also contemplated. The polypeptides of the invention may be wholly or partially 

3 0 chemically synthesized but are preferably produced by recombinant means using the genetically 
engineered cells (e.g. host cells) of the invention. 
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The invention also provides compositions comprising a polypeptide of the 
invention. Polypeptide compositions of the invention may further comprise an acceptable 
carrier, such as a hydrophilic, e.g., pharmaceutically acceptable, carrier. 

The invention also provides host cells transformed or transfected with a 
5 polynucleotide of the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
under conditions permitting expression of the desired polypeptide, and purifying the 
polypeptide from the culture or from the host cells. Preferred embodiments include those in 
10 which the protein produced by such processes is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety 
of techniques known to those skilled in the art of molecular biology. These techniques 
include use as hybridization probes, use as oligomers, or primers, for PCR, use for 
chromosome and gene mapping, use in the recombinant production of protein, and use in 
15 generation of anti-sense DNA or RNA, their chemical analogs and the like. For example, 
when the expression of an mRNA is largely restricted to a particular cell or tissue type, 
polynucleotides of the invention can be used as hybridization probes to detect the presence 
of the particular cell or tissue mRNA in a sample using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
20 expressed sequence tags for identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of conventional 
procedures and methods that are currently applied to other proteins. For example, a 
25 polypeptide of the invention can be used to generate an antibody that specifically binds the 
polypeptide. Such antibodies, particularly monoclonal antibodies, are useful for detecting or 
quantitating the polypeptide in tissue. The polypeptides of the invention can also be used as 
molecular weight markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical 
30 condition which comprises the step of administering to a mammalian subject a 

therapeutically effective amount of a composition comprising a polypeptide of the present 
invention and a pharmaceutically acceptable carrier. 
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In particular, the polypeptides and polynucleotides of the invention can be utilized, 
for example, in methods for the prevention and/or treatment of disorders involving aberrant 
protein expression or biological activity. 

The present invention further relates to methods for detecting the presence of the 
5 polynucleotides or polypeptides of the invention in a sample. Such methods can, for 

example, be utilized as part of prognostic and diagnostic evaluation of disorders as recited 
herein and for the identification of subjects exhibiting a predisposition to such conditions. 
The invention provides a method for detecting the polynucleotides of the invention in a 
sample, comprising contacting the sample with a compound that binds to and forms a 

10 complex with the polynucleotide of interest for a period sufficient to form the complex and 
under conditions sufficient to form a complex and detecting the complex such that if a 
complex is detected, the polynucleotide of interest is detected. The invention also provides a 
method for detecting the polypeptides of the invention in a sample comprising contacting the 
sample with a compound that binds to and forms a complex with the polypeptide under 

15 conditions and for a period sufficient to form the complex and detecting the formation of the 
complex such that if a complex is formed, the polypeptide is detected. 

The invention also provides kits comprising polynucleotide probes and/or 
monoclonal antibodies, and optionally quantitative standards, for carrying out methods of the 
invention. Furthermore, the invention provides methods for evaluating the efficacy of drugs, 

20 and monitoring the progress of patients, involved in clinical trials for the treatment of 
disorders as recited above. 

The invention also provides methods for the identification of compounds that 
modulate (i.e., increase or decrease) the expression or activity of the polynucleotides and/or 
polypeptides of the invention. Such methods can be utilized, for example, for the 

25 identification of compounds that can ameliorate symptoms of disorders as recited herein. 
Such methods can include, but are not limited to, assays for identifying compounds and 
other substances that interact with (e.g., bind to) the polypeptides of the invention. The 
invention provides a method for identifying a compound that binds to the polypeptides of the 
invention comprising contacting the compound with a polypeptide of the invention in a cell 

30 for a time sufficient to form a polypeptide/compound complex, wherein the complex drives 
expression of a reporter gene sequence in the cell; and detecting the complex by detecting 
the reporter gene sequence expression such that if expression of the reporter gene is detected 
the compound that binds to a polypeptide of the invention is identified. 
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The methods of the invention also provide methods for treatment which involve the 
administration of the polynucleotides or polypeptides of the invention to individuals 
exhibiting .symptoms or tendencies. In addition, the invention encompasses methods for 
treating diseases or disorders as recited herein comprising administering compounds and 
5 other substances that modulate the overall activity of the target gene products. Compounds 
and other substances can affect such modulation either on the level of target gene/protein 

expression or target protein activity. 

The polypeptides of the present invention and the polynucleotides encoding them are 
also useful for the same functions known to one of skill in the art as the polypeptides and 
10 polynucleotides to which they have homology (set forth in Tables 2A and 2B); for which 
they have a signature region (as set forth in Table 3); or for which they have homology to a 
gene family (as set forth in Tables 4A and 4B). If no homology is set forth for a sequence, 
then the polypeptides and polynucleotides of the present invention are useful for a variety of 
applications, as described herein, including use in arrays for detection. 

r 

15 

4. DETAILED DESCRIPTION OF THE INVENTION 



4.1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular forms 
20 "a", "an" and "the" include plural references unless the context clearly dictates otherwise. 

The term "active" refers to those forms of the polypeptide which retain the biologic 
and/or immunologic activities of any naturally occurring polypeptide. According to the 
invention, the terms "biologically active" or "biological activity" refer to a protein or peptide 
having structural, regulatory or biochemical functions of a naturally occurring molecule. 
25 Likewise "immunologically active" or "immunological activity" refers to the capability of 
the natural, recombinant or synthetic polypeptide to induce a specific immune response in 
appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are 
engaged in extracellular or intracellular membrane trafficking, including the export of 
30 secretory or enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence 5'~AGT-3' binds to the 
complementary sequence 3'-TCA-5\ Complementarity between two single-stranded 
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molecules may be "partial" such that only certain portion(s) of the nucleic acids bind or it 
may be "complete" such that total complementarity exists between the single stranded 
molecules. The degree of complementarity between the nucleic acid strands has significant 
effects on the efficiency and strength of the hybridization between the nucleic acid strands. 
5 The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 

differentiated cell types in an embryo or an adult, including the germ cells. The term "germ 
line stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a 
steady and continuous source of germ cells for the production of gametes. The term 
"primordial germ cells (PGCs)" refers to a small population of cells set aside from other cell 

10 lineages particularly from the yolk sac, mesenteries, or gonadal ridges during embryogenesis 
that have the potential to differentiate into germ cells and other cells. PGCs are the source 
from which GSCs and ES cells are derived. The PGCs, the GSCs and the ES cells are 
capable of self-renewal. Thus these cells not only populate the germ line and give rise to a 
plurality of terminally differentiated cells that comprise the adult specialized organs, but are 

15 able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides 
which modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 
sequence" when the expression of the sequence is altered by the presence of the EMF. 

20 EMFs include, but are not limited to, promoters, and promoter modulating sequences 
(inducible elements). One class of EMFs are nucleic acid fragments which induce the 
expression of an operably linked ORF in response to a specific regulatory factor or 
physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 

25 "oligonucleotide" are used interchangeably and refer to a heteropolymer of nucleotides or 
the sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the 
sense or the antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like 
material. In the sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and 

30 N is A, C, G, or T (U) or unknown. It is contemplated that where the polynucleotide is 
RNA, the T (thymine) in the sequences provided herein is substituted with U (uracil). 
Generally, nucleic acid segments provided by this invention may be assembled from 
fragments of the genome and short oligonucleotide linkers, or from a series of 
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oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic acid which 
is capable of being expressed in a recombinant transcriptional unit comprising regulatory 
elements derived from a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 
5 "segment" or "probe" or "primer" are used interchangeably and refer to a sequence of 

nucleotide residues which are at least about 5 nucleotides, more preferably at least about 7 
nucleotides, more preferably at least about 9 nucleotides, more preferably at least about 1 1 
nucleotides and most preferably at least about 17 nucleotides. The fragment is preferably 
less than about 500 nucleotides, preferably less than about 200 nucleotides, more preferably 
10 less than about 100 nucleotides, more preferably less than about 50 nucleotides and most 
preferably less than 30 nucleotides. Preferably the probe is from about 6 nucleotides to 
about 200 nucleotides, preferably from about 15 to about 50 nucleotides, more preferably 
from about 17 to 30 nucleotides and most preferably from about 20 to 25 nucleotides. 

* 

Preferably the fragments can be used in polymerase chain reaction (PGR), various 

15 hybridization procedures or microarray procedures to identify or amplify identical or related 
parts ofmRNA or DNA molecules. A fragment or segment may uniquely identify each 
polynucleotide sequence of the present invention. Preferably the fragment comprises a 
sequence substantially similar to any one of SEQ ID NO: 1-971, or 1943-2709. 

Probes may, for example, be used to determine whether specific mRNA molecules 

20 are present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal 
DNA as described by Walsh et al. (Walsh,P.S. et al., 1992, PGR Methods Appl 1:241-250). 
They may be labeled by nick translation, Klenow fill-in reaction, PCR, or other methods 
well known in the art. Probes of the present invention, their preparation and/or labeling are 
elaborated in Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual, Cold 

25 Spring Harbor Laboratory, NY; or Ausubel, F.M. et al., 1989, Current Protocols in 

Molecular Biology, John Wiley & Sons, New York NY, both of which are incorporated 
herein by reference in their entirety. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of SEQ ID NO: 1-971, or 1943-2709. The 

30 sequence information can be a segment of any one of SEQ ID NO: 1-971, or 1943-2709 that 
uniquely identifies or represents the sequence information of that sequence of SEQ ID NO: 
1-971, or 1943-2709, or those segments identified in Tables 3, 4A, 4B, or 5. One such 
segment can be a twenty-mer nucleic acid sequence because the probability that a twenty- 
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mer is fully matched in the human genome is 1 in 300. In the human genome, there are 
three billion base pairs in one set of chromosomes. Because 4 20 possible twenty-mers exist, 
there are 300 times more twenty-mers than there are base pairs in a set of human 
chromosomes. Using the same analysis, the probability for a seventeen-mer to be folly 
5 matched in the human genome is approximately 1 in 5. When these segments are used in 
arrays for expression studies, fifteen-mer segments can be used. The probability that the 
fifteen-mer is fully matched in the expressed sequences is also approximately one in five 
because expressed sequences comprise less than approximately 5% of the entire genome 
sequence. 

10 Similarly, when using sequence information for detecting a single mismatch, a segment 

can be a twenty-five mer. The probability that the twenty-five mer would appear in a human 
genome with a single mismatch is calculated by multiplying the probability for a foil match 
(1h-4 25 ) times the increased probability for mismatch at each nucleotide position (3 x 25). The 
probability that an eighteen mer with a single mismatch can be detected in an array for 

1 5 expression studies is approximately one in five. The probability that a twenty-mer with a single 
mismatch can be detected in a human genome is approximately one in five. 

The term "open reading frame," ORP, means a series of nucleotide triplets coding for 
amino acids without any termination codons and is a sequence translatable into protein. 

The terms "operably linked" or "operably associated" refer to functionally related 

20 nucleic acid sequences. For example, a promoter is operably associated or operably linked 
with a coding sequence if the promoter controls the transcription of the coding sequence. 
While operably linked nucleic acid sequences can be contiguous and in the same reading 
frame, certain genetic elements e.g. repressor genes are not contiguously linked to the coding 
sequence but still control transcription/translation of the coding sequence. 

25 The term "pluripotent" refers to the capability of a cell to differentiate into a number 

of differentiated cell types that are present in an adult organism. A pluripotent cell is 
restricted in its differentiation capability in comparison to a totipotent cell. 

The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an 
oligopeptide, peptide, polypeptide or protein sequence or fragment thereof and to naturally 

30 occurring or synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a 
stretch of amino acid residues of at least about 5 amino acids, preferably at least about 7 
amino acids, more preferably at least about 9 amino acids and most preferably at least about 
17 or more amino acids. The peptide preferably is not greater than about 200 amino acids, 
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more preferably less than 150 amino acids and most preferably less than 100 amino acids. 
Preferably the peptide is from about 5 to about 200 amino acids. To be active, any 
polypeptide must have sufficient length to display biological and/or immunological activity. 
The term "naturally occurring polypeptide" refers to polypeptides produced by cells 
5 that have not been genetically engineered and specifically contemplates various polypeptides 
arising from post-translational modifications of the polypeptide including, but not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 

The term "translated protein coding portion" means a sequence which encodes for the 
full-length protein which may include any leader sequence or any processing sequence. 
10 The term "mature protein coding sequence" means a sequence which encodes a 

* 

peptide or protein without a signal or leader sequence. The "mature protein portion" means 
that portion of the protein which does not include a signal or leader sequence. The peptide 
may have been produced by processing in the cell which removes any leader/signal 
sequence. The mature protein portion may or may not include the initial methionine residue. 

1 5 The methionine residue may be removed from the protein during processing in the cell. The 
peptide may be produced synthetically or the protein may have been produced using a 
polynucleotide only encoding for the mature protein coding sequence. 

The term "derivative" refers to polypeptides chemically modified by such techniques 
as ubiquitination, labeling (e.g., with radionuclides or various enzymes), covalent polymer 

20 attachment such as pegylation (derivatization with polyethylene glycol) and insertion or 

substitution by chemical synthesis of amino acids such as ornithine, which do not normally 

occur in human proteins. 

The term "variant"(or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, 

25 eg., recombinant DNA techniques. Guidance in determining which amino acid residues 

may be replaced, added or deleted without abolishing activities of interest, may be found by 
comparing the sequence of the particular polypeptide with that of homologous peptides and 
minimizing the number of amino acid sequence changes made in regions of high homology 
(conserved regions) or by replacing amino acids with consensus sequence. 

30 Alternatively, recombinant variants encoding these same or similar polypeptides may 

be synthesized or selected by making use of the "redundancy" in the genetic code. Various 
codon substitutions, such as the silent changes which produce various restriction sites, may 
be introduced to optimize cloning into a plasmid or viral vector or expression in a particular 
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prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be 
reflected in the polypeptide or domains of other peptides added to the polypeptide to modify 
the properties of any part of the polypeptide, to change characteristics such as ligand-binding 
affinities, interchain affinities, or degradation/turnover rate. 
5 Preferably, amino acid "substitutions" are the result of replacing one amino acid with 

another amino acid having similar structural and/or chemical properties, i.e., conservative 
amino acid replacements. "Conservative" amino acid substitutions maybe made on the 
basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues involved. For example, nonpolar (hydrophobic) amino 

1 0 acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and 
methionine; polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, 
asparagine, and glutamine; positively charged (basic) amino acids include arginine, lysine, 
and histidine; and negatively charged (acidic) amino acids include aspartic acid and glutamic 
acid. "Insertions" or "deletions" are preferably in the range of about 1 to 20 amino acids, 

15 more preferably 1 to 10 amino acids. The variation allowed may be experimentally 

determined by systematically making insertions, deletions, or substitutions of amino acids in 
a polypeptide molecule using recombinant DNA techniques and assaying the resulting 
recombinant variants for activity. 

Alternatively, where alteration of function is desired, insertions, deletions or 

20 non-conservative alterations can be engineered to produce altered polypeptides. Such 

alterations can, for example, alter one or more of the biological functions or biochemical 
characteristics of the polypeptides of the invention. For example, such alterations may 
change polypeptide characteristics such as ligand-binding affinities, interchain affinities, or 
degradation/turnover rate. Further, such alterations can be selected so as to generate 

25 polypeptides that are better suited for expression, scale up and the like in the host cells 
chosen for expression. For example, cysteine residues can be deleted or substituted with 
another amino acid residue in order to eliminate disulfide bridges. 

The terms "purified" or "substantially purified" as used herein denotes that the 
indicated nucleic acid or polypeptide is present in the substantial absence of other biological 

30 macromolecules, e.g., polynucleotides, proteins, and the like. In one embodiment, the 
polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, 
more preferably at least 99% by weight, of the indicated biological macromolecules present 
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(but water, buffers, and other small molecules, especially molecules having a molecular 
weight of less than 1000 daltons, can be present). 

The term "isolated 0 as used herein refers to a nucleic acid or polypeptide separated 
from at least one other component (e.g., nucleic acid or polypeptide) present with the nucleic 
5 acid or polypeptide in its natural source, hi one embodiment, the nucleic acid or polypeptide 
is found in the presence of (if anything) only a solvent, buffer, ion, or other component 
normally present in a solution of the same. The terms "isolated" and "purified" do not 
encompass nucleic acids or polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 

10 that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or 

mammalian) expression systems. "Microbial" refers to recombinant polypeptides or proteins 
made in bacterial or fungal (e.g., yeast) expression systems. As a product, "recombinant 
microbial" defines a polypeptide or protein essentially free of native endogenous substances 
and unaccompanied by associated native glycosylation. Polypeptides or proteins expressed 

1 5 in most bacterial cultures, e.g., E. coli, will be free of glycosylation modifications; 

polypeptides or proteins expressed in yeast will have a glycosylation pattern in general 
different from those expressed in mammalian cells. 

The term "recombinant expression vehicle or vector" refers to a plasmid or phage or 
virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression 

20 vehicle can comprise a transcriptional unit comprising an assembly of (1) a genetic element 
or element's having a regulatory role in gene expression, for example, promoters or 
enhancers, (2) a structural or coding sequence which is transcribed into mRNA and 
translated into protein, and (3) appropriate transcription initiation and termination sequences. 
Structural units intended for use in yeast or eukaryotic expression systems preferably include 

25 a leader sequence enabling extracellular secretion of translated protein by a host cell. 
Alternatively, where recombinant protein is expressed without a leader or transport 
sequence, it may include an amino terminal methionine residue. This residue may or may 
not be subsequently cleaved from the expressed recombinant protein to provide a final 
product. 

30 The term "recombinant expression system" means host cells which have stably 

integrated a recombinant transcriptional unit into chromosomal DNA or carry the 
recombinant transcriptional unit extrachromosomally. Recombinant expression systems as 
defined herein will express heterologous polypeptides or proteins upon induction of the 
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regulatory elements linked to the DNA segment or synthetic gene to be expressed. This 
term also means host cells which have stably integrated a recombinant genetic element or 
elements having a regulatory role in gene expression, for example, promoters or enhancers. 
Recombinant expression systems as defined herein will express polypeptides or proteins 
5 endogenous to the cell upon induction of the regulatory elements linked to the endogenous 
DNA segment or gene to be expressed. The cells can be prokaryotic or eukaryotic. 

The term "secreted" includes a protein that is transported across or through a 
membrane, including transport as a result of signal sequences in its amino acid sequence 
when it is expressed in a suitable host cell. "Secreted" proteins include without limitation 

10 proteins secreted wholly (e.g., soluble proteins) or partially (e.g., receptors) from the cell in 
which they are expressed. "Secreted" proteins also include without limitation proteins that 
are transported across the membrane of the endoplasmic reticulum. "Secreted" proteins are 
also intended to include proteins containing non-typical signal sequences (e.g. Interleukin- 1 
Beta, see Krasney, P.A. and Young, P.R. (1992) Cytokine 4(2): 134 -143) and factors 

15 released from damaged cells (e.g. Interleukin- 1 Receptor Antagonist, see Arend, W.P. et. al. 
(1998) Annu. Rev. Immunol. 16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a cell. Such a 
sequence may be naturally present on the polypeptides of the present invention or provided 

20 from heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly understood in 
the art as stringent. Stringent conditions can include highly stringent conditions (i.e., 
hybridization to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 
mM EDTA at 65°C, and washing in 0.1X SSC/0.1% SDS at 68°C), and moderately stringent 

25 conditions (i.e., washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization 
conditions are described herein in the examples. 

In instances of hybridization of deoxyoligonucleotides, additional exemplary 
stringent hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate 
at 37°C (for 14-base oligonucleotides), 48°C (for 17-base oligonucleotides), 55°C (for 20- 

30 base oligonucleotides), and 60°C (for 23-base oligonucleotides). 

As used herein, "substantially equivalent" or "substantially similar" can refer both to 
nucleotide and amino acid sequences, for example a mutant sequence, that varies from a 
reference sequence by one or more substitutions, deletions, or additions, the net effect of 
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which does not result in an adverse functional dissimilarity between the reference and 
subject sequences. Typically, such a substantially equivalent sequence varies from one of 
those listed herein by no more than about 35% {i.e., the number of individual residue 
substitutions, additions, and/or deletions in a substantially equivalent sequence, as compared 
5 to the corresponding reference sequence, divided by the total number of residues in the 
substantially equivalent sequence is about 0.35 or less). Such a sequence is said to have 
65% sequence identity to the listed sequence. In one embodiment, a substantially 
equivalent, e.g., mutant, sequence of the invention varies from a listed sequence by no more 
than 30% (70% sequence identity); in a variation of this embodiment, by no more than 25% 

10 (75% sequence identity); and in a further variation of this embodiment, by no more than 

20% (80% sequence identity) and in a further variation of this embodiment, by no more than 
10% (90% sequence identity) and in a further variation of this embodiment, by no more that 
5% (95% sequence identity). Substantially equivalent, e.g., mutant, amino acid sequences 
according to the invention preferably have at least 80% sequence identity with a listed amino 

1 5 acid sequence, more preferably at least 85% sequence identity, more preferably at least 90% 
sequence identity, more preferably at least 95% sequence identity, more preferably at least 
98% sequence identity, and most preferably at least 99% sequence identity. Substantially 
equivalent nucleotide sequence of the invention can have lower percent sequence identities, 
taking into account, for example, the redundancy or degeneracy of the genetic code. 

20 Preferably, the nucleotide sequence has at least about 65% identity, more preferably at least 
about 75% identity, more preferably at least about 80% sequence identity, more preferably at 
least 85% sequence identity, more preferably at least 90% sequence identity, more preferably 
at least about 95% sequence identity, more preferably at least 98% sequence identity, and 
most preferably at least 99% sequence identity. For the purposes of the present invention, 

25 sequences having substantially equivalent biological activity and substantially equivalent 
expression characteristics are considered substantially equivalent. For the purposes of 
determining equivalence, truncation of the mature sequence {e.g., via a mutation which 
creates a new stop codon) should be disregarded. Sequence identity may be determined, 
e.g., using the Jotun Hein method (Hein, J. (1 990) Methods Enzymol. 1 83 :626-645). 

30 Identity between sequences can also be determined by other methods known in the art, e.g. 
by varying hybridization conditions. 

The term "totipotent" refers to the capability of a cell to differentiate into all of the 

cell types of an adult organism. 
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The term "transformation" means introducing DNA into a suitable host cell so that 
the DNA is replicable, either as an extrachromosomal element, or by chromosomal 
integration. The term "transfection" refers to the taking up of an expression vector by a 
suitable host cell, whether or not any coding sequences are in fact expressed. The term 
5 "infection" refers to the introduction of nucleic acids into a suitable host cell by use of a 

virus or viral vector. 

As used herein, an "uptake modulating fragment," UMF, means a series of 
nucleotides which mediate the uptake of a linked DNA fragment into a cell. UMFs can be 
readily identified using known UMFs as a target sequence or target motif with the 

10 computer-based systems described below. The presence and activity of a UMF can be 

confirmed by attaching the suspected UMF to a marker sequence. The resulting nucleic acid 
molecule is then incubated with an appropriate host under appropriate conditions and the 
uptake of the marker sequence is determined. As described above, a UMF will increase the 
frequency of uptake of a linked marker sequence. 

15 Each of the above terms is meant to encompass all that is described for each, unless 

the context dictates otherwise. 



4.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 

20 The isolated polynucleotides of the invention include a polynucleotide comprising 

the nucleotide sequences of SEQ ID NO: 1-971, or 1943-2709; a polynucleotide encoding 
any one of the peptide sequences of SEQ ID NO: 1-971, or 1943-2709; and a polynucleotide 
comprising the nucleotide sequence encoding the mature protein coding sequence of the 
polynucleotides of any one of SEQ ID NO: 1-971, or 1943-2709. The polynucleotides of the 

25 present invention also include, but are not limited to, a polynucleotide that hybridizes under 
stringent conditions to (a) the complement of any of the nucleotides sequences of SEQ ID 
NO: 1-971, or 1943-2709; (b) nucleotide sequences encoding any one of the amino acid 
sequences set forth in the Sequence Listing; (c) a polynucleotide which is an allelic variant 
of any polynucleotide recited above; (d) a polynucleotide which encodes a species 

30 homologue of any of the proteins recited above; or (e) a polynucleotide that encodes a 

polypeptide comprising a specific domain or truncation of the polypeptides of SEQ ID NO: 
972-1942, or 2710-3476 (for example, as set forth in Tables 3, 4A, 4B, or 5). Domains of 
interest may depend on the nature of the encoded polypeptide; e.g., domains in receptor-like 
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polypeptides include ligand-binding, extracellular, transmembrane, or cytoplasmic 
domains, or combinations thereof; domains in immunoglobulin-like proteins include the 
variable immunoglobulin-like domains; domains in enzyme-like polypeptides include 
catalytic and substrate binding domains; and domains in ligand polypeptides include 

5 receptor-binding domains. 

The polynucleotides of the invention include naturally occurring or wholly or 
partially synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The 
polynucleotides may include entire coding region of the cDNA or may represent a portion of 

the coding region of the cDNA. 

10 The present invention also provides genes corresponding to the cDNA sequences 

disclosed herein. The corcesponding genes can be isolated in accordance with known methods 
using the sequence information disclosed herein. Such methods include the preparation of 
probes or primers from the disclosed sequence infonnation for identification and/or 
amplification of genes in appropriate genomic libraries or other sources of genomic materials. 

1 5 Further 5' and 3' sequence can be obtained using methods known in the art. For example, full 
length cDNA or genomic DNA that corresponds to any of the polynucleotides of SEQ ID NO: 
1-971, or 1943-2709 can be obtained by screening appropriate cDNA or genomic DNA 
libraries under suitable hybridization conditions using any of the polynucleotides of SEQ ID 
NO: 1-971, or 1943-2709 or a portion thereof as a probe. Alternatively, the polynucleotides of 

20 SEQ ID NO: 1-971, or 1943-2709 may be used as the basis for suitable primer(s) that allow 
identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 

The nucleic acid sequences of the invention can be assembled from ESTs and sequences 
(including cDNA and genomic sequences) obtained from one or more public databases, such as 
dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence 

25 information, representative fragment or segment information, or novel segment information for 

the full-length gene. 

The polynucleotides of the invention also provide polynucleotides including 
nucleotide sequences that are substantially equivalent to the polynucleotides recited above. 
Polynucleotides according to the invention can have, e.g., at least about 65%, at least about 
30 70%, at least about 75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least 
about 85%, 86%>, 87%, 88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, 
and even more typically at least about 95%, 96%, 97%, 98%, 99% sequence identity to a 
polynucleotide recited above. 
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Included within the scope of the nucleic acid sequences of the invention are nucleic 
acid sequence fragments that hybridize under stringent conditions to any of the nucleotide 
sequences of SEQ ID NO: 1-971, or 1943-2709, or complements thereof, which fragment is 
greater than about 5 nucleotides, preferably 7 nucleotides, more preferably greater than 9 
5 nucleotides and most preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 
nucleotides or more that are selective for (i.e. specifically hybridize to) any one of the 
polynucleotides of the invention are contemplated. Probes capable of specifically 
hybridizing to a polynucleotide can differentiate polynucleotide sequences of the invention 
from other polynucleotide sequences in the same family of genes or can differentiate human 

1 0 genes from genes of other species, and are preferably based on unique nucleotide sequences. 

The sequences falling within the scope of the present invention are not limited to these 
specific sequences, but also include allelic and species variations thereof. Allelic and species 
variations can be routinely determined by comparing the sequence provided in SEQ ID NO: 1- 
971, or 1943-2709, a representative fragment thereof, or a nucleotide sequence at least 90% 

15 identical, preferably 95% identical, to SEQ ID NO: 1-971, or 1943-2709 with a sequence from 
another isolate of the same species. Furthermore, to accommodate codon variability, the 
invention includes nucleic acid molecules coding for the same amino acid sequences as do the 
specific ORFs disclosed herein. In other words, in the coding region of an ORF, substitution of 
one codon for another codon that encodes the same amino acid is expressly contemplated. 

20 The nearest neighbor or homology results for the nucleic acids of the present invention, 

including SEQ ID NO: 1-971, or 1943-2709 can be obtained by searching a database using an 
algorithm or a program. Preferably, a BLAST (Basic Local Alignment Search Tool) program is 
used to search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1993) and 
Altschul S.F. et al. J. Mol. Biol. 21:403-410 (1990)). Alternatively a FASTA version 3 search 

25 against Genpept, using FASTXY algorithm may be performed. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are 
also provided by the present invention. Species homologs may be isolated and identified by 
making suitable probes or primers from the sequences provided herein and screening a 
suitable nucleic acid source from the desired species. 

30 The invention also encompasses allelic variants of the disclosed polynucleotides or 

proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide which 
also encode proteins which are identical, homologous or related to that encoded by the 
polynucleotides. 
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The nucleic acid sequences of the invention are further directed to sequences which 
encode variants of the described nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a 
native or variant polynucleotide. There are two variables in the construction of amino acid 
5 sequence variants: the location of the mutation and the nature of the mutation. Nucleic 

acids encoding the amino acid sequence variants are preferably constructed by mutating the 
polynucleotide to encode an amino acid sequence that does not occur in nature. These 
nucleic acid alterations can be made at sites that differ in the nucleic acids from different 
species (variable positions) or in highly conserved regions (constant regions). Sites at such 

10 locations will typically be modified in series, e.g., by substituting first with conservative 

choices (e.g., hydrophobic amino acid to a different hydrophobic amino acid) and then with 
more distant choices (e.g., hydrophobic amino acid to a charged amino acid), and then 
deletions or insertions may be made at the target site. Amino acid sequence deletions 
generally range from about 1 to 30 residues, preferably about 1 to 10 residues, and are 

15 typically contiguous. Amino acid insertions include amino- and/or carboxyl-terminal 

fusions ranging in length from one to one hundred or more residues, as well as intrasequence 
insertions of single or multiple amino acid residues. Intrasequence insertions may range 
generally from about 1 to 10 amino residues, preferably from 1 to 5 residues. Examples of 
terminal insertions include the heterologous signal sequences necessary for secretion or for 

20 intracellular targeting in different host cells and sequences such as FLAG or poly-histidine 
sequences useful for purifying the expressed protein. 

In a preferred method, polynucleotides encoding the novel amino acid sequences are 
changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter 
a polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent 

25 nucleotides on both sides of the changed amino acid to form a stable duplex on either side of 
the site of being changed. In general, the techniques of site-directed mutagenesis are well 
known to those of skill in the art and this technique is exemplified by publications such as, 
Edelman et al., DNA 2:183 (1983). A versatile and efficient method for producing 
site-specific changes in a polynucleotide sequence was published by Zoller and Smith, 

30 Nucleic Acids Res. 10:6487-6500 (1982). PGR may also be used to create amino acid 

sequence variants of the novel nucleic acids. When small amounts of template DNA are 
used as starting material, primer(s) that differs slightly in sequence from the corresponding 
region in the template DNA can generate the desired amino acid variant. PCR amplification 
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results in a population of product DNA fragments that differ from the polynucleotide 
template encoding the polypeptide at the position specified by the primer. The product DNA 
fragments replace the corresponding region in the plasmid and this gives a polynucleotide 
encoding the desired amino acid variant. 
5 A further technique for generating amino acid variants is the cassette mutagenesis 

technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis techniques 
well known in the art, such as, for example, the techniques in Sambrook et al., supra, and 
Current Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of 
the genetic code, other DNA sequences which encode substantially the same or a 
1 0 functionally equivalent amino acid sequence may be used in the practice of the invention for 
the cloning and expression of these novel nucleic acids. Such DNA sequences include those 
which are capable of hybridizing to the appropriate novel nucleic acid sequence under 
stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention could be 

1 5 used to generate polynucleotides encoding chimeric or fusion proteins comprising one or 
more domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of 
the polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, 
amplified, or synthetic) or RNA. Methods and algorithms for obtaining such 

20 polynucleotides are well known to those of skill in the art and can include, for example, 

methods for determining hybridization conditions that can routinely isolate polynucleotides 
of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the mature 
protein coding sequences corresponding to any one of SEQ ID NO: 1-971, or 1943-2709, or 

25 functional equivalents thereof, may be used to generate recombinant DNA molecules that 
direct the expression of that nucleic acid, or a functional equivalent thereof, in appropriate 
host cells. Also included are the cDNA inserts of any of the clones identified herein. 

A polynucleotide according to the invention can be joined to any of a variety of other 
nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et 

30 al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). 
Useful nucleotide sequences for joining to polynucleotides include an assortment of vectors, 
e.g., plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well 
known in the art. Accordingly, the invention also provides a vector including a 
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polynucleotide of the invention and a host cell containing the polynucleotide. In general, 
the vector contains an origin of replication functional in at least one organism, convenient 
restriction endonuclease sites, and a selectable marker for the host cell. Vectors according to 
the invention include expression vectors, replication vectors, probe generation vectors, and 
5 sequencing vectors. A host cell according to the invention can be a prokaryotic or 
eukaryotic cell and can be a unicellular organism or part of a multicellular organism. 

The present invention further provides recombinant constructs comprising a nucleic 
acid having any of the nucleotide sequences of SEQ ID NO: 1-971, or 1943-2709 or a 
fragment thereof or any other polynucleotides of the invention. In one embodiment, the 

10 recombinant constructs of the present invention comprise a vector, such as a plasmid or viral 
vector, into which a nucleic acid having any of the nucleotide sequences of SEQ ID NO: 1- 
971, or 1943-2709 or a fragment thereof is inserted, in a forward or reverse orientation. In 
the case of a vector comprising one of the ORFs of the present invention, the vector may 
further comprise regulatory sequences, including for example, a promoter, operably linked to 

1 5 the ORF. Large numbers of suitable vectors and promoters are known to those of skill in the 
art and are commercially available for generating the recombinant constructs of the present 
invention. The following vectors are provided by way of example: Bacterial: pBs, 
phagescript, PsiX174, pBluescript SK, pBs KS, pNH8a, pNH16a, pNH18a, pNH46a 
(Stratagene), P Trc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia); Eukaryotic: 

20 pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et al., 
Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinantly. 

25 Many suitable expression control sequences are known in the art. General methods of 
expressing recombinant proteins are also known and are exemplified in R. Kaufman, 
Methods in Enzymology 185, 537-566 (1990). As defined herein "operably linked" means 
that the isolated polynucleotide of the invention and an expression control sequence are 
situated within a vector or cell in such a way that the protein is expressed by a host cell 

30 which has been transformed (transfected) with the ligated polynucleotide/expression control 
sequence. 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
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appropriate vectors are pKK232-8 and pCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic promoters include CMV 
immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 
mouse metallothionein-I. Selection of the appropriate vector and promoter is well within the 
5 level of ordinary skill in the art. Generally, recombinant expression vectors will include 
origins of replication and selectable markers permitting transformation of the host cell, e.g., 
the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived 
from a highly expressed gene to direct transcription of a downstream structural sequence. 
Such promoters can be derived from operons encoding glycolytic enzymes such as 3- 
10 phosphoglycerate kinase (PGK), a-factor, acid phosphatase, or heat shock proteins, among 
others. The heterologous structural sequence is assembled in appropriate phase with 
translation initiation and termination sequences, and preferably, a leader sequence capable of 
directing secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an amino 
1 5 terminal identification peptide imparting desired characteristics, e.g. , stabilization or 

simplified purification of expressed recombinant product. Useful expression vectors for 
bacterial use are constructed by inserting a structural DNA sequence encoding a desired 
protein together with suitable translation initiation and termination signals in operable 
reading phase with a functional promoter. The vector will comprise one or more phenotypic 
20 selectable markers and an origin of replication to ensure maintenance of the vector and to, if 
desirable, provide amplification within the host. Suitable prokaryotic hosts for 
transformation include E. coli, Bacillus subtilis, Salmonella typhimurium and various species 
within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may 
also be employed as a matter of choice. 
25 As a representative but non-limiting example, useful expression vectors for bacterial 

use can comprise a selectable marker and bacterial origin of replication derived from 
commercially available plasmids comprising genetic elements of the well known cloning 
vector pBR322 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 
(Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, 
30 USA). These pBR322 "backbone" sections are combined with an appropriate promoter and 
the structural sequence to be expressed. Following transformation of a suitable host strain 
and growth of the host strain to an appropriate cell density, the selected promoter is induced 
or derepressed by appropriate means (e.g. , temperature shift or chemical induction) and cells 
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are cultured for an additional period. Cells are typically harvested by centrifiigation, 
disrupted by physical or chemical means, and the resulting crude extract retained for further 
purification. 

Polynucleotides of the invention can also be used to induce immune responses. For 
5 example, as described in Fan et al., Nat. Biotech 17, 870-872 (1999), incorporated herein by 
reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 
against the encoded polypeptide following topical administration of naked plasmid DNA or 
following injection, and preferably intra-muscular injection of the DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form 
10 of naked DNA. 



4.3 ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules 
that are hybridizable to or complementary to the nucleic acid molecule comprising the 

15 nucleotide sequence of SEQ ID NO: 1-971, or 1943-2709, or fragments, analogs or 

derivatives thereof. An "antisense" nucleic acid comprises a nucleotide sequence that is 
complementary to a "sense" nucleic acid encoding a protein, e.g., complementary to the 
coding strand of a double-stranded cDNA molecule or complementary to an mRNA 
sequence. .In specific aspects, antisense nucleic acid molecules are provided that comprise a 

20 sequence complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an 
entire coding strand, or to only a portion thereof. Nucleic acid molecules encoding 
fragments, homologs, derivatives and analogs of a protein of any of SEQ ID NO: 1-971, or 
1943-2709 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ ID 
NO: 1-971, or 1943-2709 are additionally provided. 

* 

25 In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 

region" of the coding strand of a nucleotide sequence of the invention. The term "coding 
region" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residues. In another embodiment, the antisense nucleic acid 
molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence 

30 of the invention. The term "noncoding region" refers to 5' and 3' sequences that flank the 
coding region that are not translated into amino acids {i.e., also referred to as 5' and 3' 
untranslated regions). 
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Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g., 
SEQ ID NO: 1-971, or 1943-2709, antisense nucleic acids of the invention can be designed 
according to the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic 
acid molecule can be complementary to the entire coding region of an mRNA, but more 
5 preferably is an oligonucleotide that is antisense to only a portion of the coding or noncoding 
region of an mRNA. For example, the antisense oligonucleotide can be complementary to 
the region surrounding the translation start site of an mRNA. An antisense oligonucleotide 
can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An 
antisense nucleic acid of the invention can be constructed using chemical synthesis or 

10 enzymatic ligation reactions using procedures known in the art. For example, an antisense 
nucleic acid (e.g., an antisense oligonucleotide) can be chemically synthesized using 
naturally occurring nucleotides or variously modified nucleotides designed to increase the 
biological stability of the molecules or to increase the physical stability of the duplex formed 
between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and 

1 5 acridine substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense nucleic 
acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 
xanthine, 4-acetylcytosine, 5 -(carboxyhydroxylmethyl) uracil, 5- 
carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 

20 dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 

1- methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3- 
methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 

methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 
5 f -methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, 
25 uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5 -methyl- 

2- thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, 
uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, 
(acp3)w, and 2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced 
biologically using an expression vector into which a nucleic acid has been subcloned in an 

30 antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of interest, described farther in the following 
subsection). 
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The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a protein according to the invention to thereby inhibit expression of 
the protein, e.g. , by inhibiting transcription and/or translation. The hybridization can be by 
5 conventional nucleotide complementarity to form a stable duplex, or, for example, in the 
case of an antisense nucleic acid molecule that binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An example of a route of 
administration of antisense nucleic acid molecules of the invention includes direct injection 
at a tissue site. Alternatively, antisense nucleic acid molecules can be modified to target 

10 selected cells and then administered systemically. For example, for systemic administration, 
antisense molecules can be modified such that they specifically bind to receptors or antigens 
expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules to 
peptides or antibodies that bind to cell surface receptors or antigens. The antisense nucleic 
acid molecules can also be delivered to cells using the vectors described herein. To achieve 

15 sufficient intracellular concentrations of antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the control of a strong p : ol II or pol III 

* 

promoter are preferred. 

hi yet another embodiment, the antisense nucleic acid molecule of the invention is an 

a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific 
20 double-stranded hybrids with complementary RNA in which, contrary to the usual oc-units, 
the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids Res 15: 
6625-6641). The antisense nucleic acid molecule can also comprise a 
2'-o-methylribonucleotide (Inoue et al (1987) Nucleic Acids Res 15: 6131-6148) or a 
chimeric RNA -DNA analogue (Inoue et al (1987) FEBSLett 215: 327-330). 

25 

4.4 RIBOZYMES AND PNA MOIETIES 

In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 
cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 
30 complementary region. Thus, ribozymes {e.g., hammerhead ribozymes (described in 
Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave 
mRNA transcripts to thereby inhibit translation of an mRNA. A ribozyme having specificity 
for a nucleic acid of the invention can be designed based upon the nucleotide sequence of a 
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DNA disclosed herein (i.e., SEQ ID NO: 1-971, or 1943-2709). For example, a derivative 
of Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the 
active site is complementary to the nucleotide sequence to be cleaved in a mRNA. See, e.g., 
Cech et al U.S. Pat. No. 4,987,071; and Cech et al U.S. Pat. No. 5,116,742. Alternatively, 
5 mRNA of the invention can be used to select a catalytic RNA having a specific ribonuclease 
activity from a pool of RNA molecules. See, e.g., Bartel et al, (1993) Science 
261:1411-1418. 

Alternatively, gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region {e.g., promoter and/or enhancers) to form triple 

10 helical structures that prevent transcription of the gene in target cells. See generally, Helene. 
(1991) Anticancer Drug Des. 6: 569-84; Helene. et al (1992) Ann. N.Y. Acad. Sci. 
660:27-36; and Maher ( 1992) Bioassays 14: 807-15. 

In various embodiments, the nucleic acids of the invention can be modified at the 
base moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, 

15 hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 

backbone of the nucleic acids can be modified to generate peptide nucleic acids (see Hyrup 
et al (1996) BioorgMed Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" 
or "PNAs" refer to nucleic acid mimics, e.g. , DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural 

20 nucleobases are retained. The neutral backbone of PNAs has been shown to allow for 
specific hybridization to DNA and RNA under conditions of low ionic strength. The 
synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis 
protocols as described in Hyrup et al (1996) above; Perry-O'Keefe et al (1996) PNAS 93: 
14670-675. 

25 PNAs of the invention can be used in therapeutic and diagnostic applications. For 

example, PNAs can be used as antisense or antigene agents for sequence-specific modulation 
of gene expression by, e.g., inducing transcription or translation arrest or inhibiting 
replication. PNAs of the invention can also be used, e.g., in the analysis of single base pair 
mutations in a gene by, e.g., PNA directed PGR clamping; as artificial restriction enzymes 

30 when used in combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); 
or as probes or primers for DNA sequence and hybridization (Hyrup et al. (1996), above; 
Perry-O'Keefe (1996), above). 
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In another embodiment, PNAs of the invention can be modified, e.g., to enhance 
their stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by 
the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art. For example, PNA-DNA chimeras can be generated that may 
5 combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA 
portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of 
base stacking, number of bonds between the nucleobases, and orientation (Hyrup (1996) 

1 0 above). The synthesis of PNA-DNA chimeras can be performed as described in Hyrup 

(1996) above and Finn et al (1996) Nucl Acids Res 24: 3357-63. For example, a DNA chain 
can be synthesized on a solid support using standard phosphor amidite coupling chemistry, 
and modified nucleoside analogs, e.g., 5'-(4-methoxytrityl)amino-5 , -deoxy-thymidine 
phosphoramidite, can be used between the PNA and the 5' end of DNA (Mag et al. (1989) 

1 5 Nucl Acid Res 17: 5973-88). PNA monomers are then coupled in a stepwise manner to 
produce a chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al 
(1996) above). Alternatively, chimeric molecules can be synthesized with a 5' DNA 
segment and a 3' PNA segment. See, Petersen et al (1975) Bioorg Med Chem Lett 5: 
1119-11124. 

20 In other embodiments, the oligonucleotide may include other appended groups such 

as peptides {e.g. , for targeting host cell receptors in vivo), or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl Acad. Set U.S.A. 
86:6553-6556; Lemaitre et al, 1987, Proc. Natl Acad. Sci. 84:648-652; PCT Publication 
No. W088/09810) or the blood-brain barrier (see, e.g., PCT Publication No. W089/10134). 

25 In addition, oligonucleotides can be modified with hybridization triggered cleavage agents 
(See, e.g., Krol et al, 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., 
Zon, 1988, Pharm. Res. 5: 539-549). To this end, the oligonucleotide may be conjugated to 
another molecule, e.g., a peptide, a hybridization triggered cross-linking agent, a transport 
agent, a hybridization-triggered cleavage agent, etc. 



4.5 HOSTS 

The present invention further provides host cells genetically engineered to contain 
the polynucleotides of the invention. For example, such host cells may contain nucleic acids 
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of the invention introduced into the host cell using known transformation, transfection or 
infection methods. The present invention still further provides host cells genetically 
engineered to express the polynucleotides of the invention, wherein such polynucleotides are 
in operative association with a regulatory sequence heterologous to the host cell which 
5 drives expression of the polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
increase, expression of endogenous polypeptide. Cells can be modified (e.g., by 
homologous recombination) to provide increased polypeptide expression by replacing, in 
whole or in part, the naturally occurring promoter with all or part of a heterologous promoter 

10 so that the cells express the polypeptide at higher levels. The heterologous promoter is 
inserted in such a manner that it is operatively linked to the encoding sequences. See, for 
example, PCT International Publication No. WO94/12650, PCT International Publication 
No. WO92/20808, and PCT International Publication No. WO91/09955. It is also 
contemplated that, in addition to heterologous promoter DNA, amplifiable marker DNA 

15 (e.g., ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate 
synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be 
inserted along with the heterologous promoter DNA. If linked to the coding sequence, 
amplification of the marker DNA by standard selection methods results in co-amplification 
of the desired protein coding sequences in the cells. 

20 The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 

eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 
calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation 
(Davis, L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one 

25 of the polynucleotides of the invention, can be used in conventional manners to produce the 
gene product encoded by the isolated fragment (in the case of an ORF) or can be used to 
produce a heterologous protein under the control of the EMF. 

Any host/vector system can be used to express one or more of the ORFs of the 
present invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, 

30 Cv-1 cell, COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and 
B. subtilis. The most preferred cells are those which do not normally express the particular 
polypeptide or protein or which expresses the polypeptide or protein at low natural level. 
Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under 
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the control of appropriate promoters. Cell-free translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs of the present 
invention. Appropriate cloning and expression vectors for use with prokaryotic and 
eukaryotic hosts are described by Sambrook, et al., in Molecular Cloning: A Laboratory 
5 Manual, Second Edition, Cold Spring Harbor, New York (1989), the disclosure of which is 
hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 lines 
of monkey kidney fibroblasts, described by Gluzman, Cell 23:175 (1981). Other cell lines 

10 capable of expressing a compatible vector are, for example, the C127, monkey COS cells, 
Chinese Hamster Ovary (CHO) cells, human kidney 293 cells, human epidermal A43 1 cells, 
human Colo205 cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal 
diploid cells, cell strains derived from in vitro culture of primary tissue, primary explants, 
HeLa cells, mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. Mammalian expression 

15 vectors will comprise an origin of replication, a suitable promoter and also any necessary 
ribosome binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional 
termination sequences, and 5' flanking nontranscribed sequences. DNA sequences derived 
from the SV40 viral genome, for example, SV40 origin, early promoter, enhancer, splice, 
and polyadenylation sites may be used to provide the required nontranscribed genetic 

20 elements. Recombinant polypeptides and proteins produced in bacterial culture are usually 
isolated by initial extraction from cell pellets, followed by one or more salting-out, aqueous 
ion exchange or size exclusion chromatography steps. Protein refolding steps can be used, 
as necessary, in completing configuration of the mature protein. Finally, high performance 
liquid chromatography (HPLC) can be employed for final purification steps. Microbial cells 

25 employed in expression of proteins can be disrupted by any convenient method, including 
freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. 

Alternatively, it may be possible to produce the protein in lower eukaryotes such as 
yeast or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, 

30 or any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 
strain capable of expressing heterologous proteins. If the protein is made in yeast or 
bacteria, it may be necessary to modify the protein produced therein, for example by 
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phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional 
protein. Such covalent attachments may be accomplished using known chemical or 
enzymatic methods. 

In another embodiment of the present invention, cells and tissues may be engineered 
5 to express an endogenous gene comprising the polynucleotides of the invention under the 
control of inducible regulatory elements, in which case the regulatory sequences of the 
endogenous gene may be replaced by homologous recombination. As described herein, gene 
targeting can be used to replace a gene's existing regulatory region with a regulatory 
sequence isolated from a different gene or a novel regulatory sequence synthesized by 

1 0 genetic engineering methods. Such regulatory sequences may be comprised of promoters, 
enhancers, scaffold-attachment regions, negative regulatory elements, transcriptional 
initiation sites, and regulatory protein binding sites or combinations of said sequences. 
Alternatively, sequences which affect the structure or stability of the RNA or protein 
produced may be replaced, removed, added, or otherwise modified by targeting. These 

1 5 sequence include polyadenylation signals, mRNA stability elements, splice sites, leader 
sequences for enhancing or modifying transport or secretion properties of the protein, or 
other sequences which alter or improve the function or stability of protein or RNA 
molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 
20 gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple 
deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory 
element. Alternatively, the targeting event may replace an existing element; for example, a 
tissue-specific enhancer can be replaced by an enhancer that has broader or different 
25 cell-type specificity than the naturally occurring elements. Here, the naturally occurring 
sequences are deleted and new sequences are added. In all cases, the identification of the 
targeting event may be facilitated by the use of one or more selectable marker genes that are 
contiguous with the targeting DNA, allowing for the selection of cells in which the 
exogenous DNA has integrated into the host cell genome. The identification of the targeting 
30 event may also be facilitated by the use of one or more marker genes exhibiting the property 
of negative selection, such that the negatively selectable marker is linked to the exogenous 
DNA, but configured such that the negatively selectable marker flanks the targeting 
sequence, and such that a correct homologous recombination event with sequences in the 
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host cell genome does not result in the stable integration of the negatively selectable 
marker. Markers useful for this purpose include the Herpes Simplex Virus thymidine kinase 
(TK) gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance 
5 with this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 
to Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et al.; and International Application No. 
PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by 
reference herein in its entirety. 

10 

4.6 POLYPEPTIDES OF THE INVENTION 

The isolated polypeptides of the invention include, but are not limited to, a 
polypeptide comprising: the amino acid sequences set forth as any one of SEQ ID NO: 972- 
1942, or 2710-3476 or an amino acid sequence encoded by any one of the nucleotide 

15 sequences SEQ ID NO: 1-971, or 1943-2709 or the corresponding full length or mature 

protein. Polypeptides of the invention also include polypeptides preferably with biological or 
immunological activity that are encoded by: (a) a polynucleotide having any one of the 
nucleotide sequences set forth in SEQ ID NO: 1-971, or 1943-2709 or (b) polynucleotides 
encoding any one of the amino acid sequences set forth as SEQ ID NO: 972-1942, or 2710- 

20 3476 or (c) polynucleotides that hybridize to the complement of the polynucleotides of either 
(a) or (b) under stringent hybridization conditions. The invention also provides biologically 
active or immunologically active variants of any of the amino acid sequences set forth as 
SEQ ID NO: 972-1942, or 2710-3476 or the corresponding full length or mature protein; and 
"substantial equivalents" thereof (e.g., with at least about 65%, at least about 70%, at least 

25 about 75%, at least about 80%, at least about 85%, 86%, 87%, 88%, 89%, at least about 

90%>, 91%, 92%, 93%, 94%), typically at least about 95%, 96%, 97%, more typically at least 
about 98%), or most typically at least about 99% amino acid identity) that retain biological 
activity. Polypeptides encoded by allelic variants may have a similar, increased, or 
decreased activity compared to polypeptides comprising SEQ ID NO: 972-1942, or 2710- 

30 3476. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the protein 
may be in linear form or they may be cyclized using known methods, for example, as 
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described in H. U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. 
McDowell, et al., J. Amer. Chem. Soc. 1 14, 9245-9253 (1992), both of which are 
incorporated herein by reference. Such fragments may be fused to carrier molecules such as 
immunoglobulins for many purposes, including increasing the valency of protein binding 
5 sites. Fragments are also identified in Tables 3, 4 A, 4B, or 5. 

The present invention also provides both full-length and mature forms (for example, 
without a signal sequence or precursor sequence) of the disclosed proteins. The protein 
coding sequence is identified in the sequence listing by translation of the disclosed 
nucleotide sequences. The predicted signal sequence is set forth in Table 6. The mature 

1 0 form of such protein may be obtained and confirmed by expression of a full-length 

polynucleotide in a suitable mammalian cell or other host cell and sequencing of the cleaved 
product. One of skill in the art will recognize that the actual cleavage site may be different 
than that predicted in Table 6. The sequence of the mature form of the protein is also 
determinable from the amino acid sequence of the full-length form. Where proteins of the 

15 present invention are membrane bound, soluble forms of the proteins are also provided. In 
such forms, part or all of the regions causing the proteins to be membrane bound are deleted 
so that the proteins are fully secreted from the cell in which they are expressed (See, e.g., 
Sakal et al., Prep. Biochem. Biotechnol. (2000), 30(2), pp. 107-23, incorporated herein by 
reference). 

20 Protein compositions of the present invention may further comprise an acceptable 

carrier, such as a hydrophilic, e.g., pharmaceutically acceptable, carrier. 

The present invention further provides isolated polypeptides encoded by the nucleic 
acid fragments of the present invention or by degenerate variants of the nucleic acid 
fragments of the present invention. By "degenerate variant" is intended nucleotide 

25 fragments which differ from a nucleic acid fragment of the present invention (e.g. , an ORF) 

* 

by nucleotide sequence but, due to the degeneracy of the genetic code, encode an identical 
polypeptide sequence. Preferred nucleic acid fragments of the present invention are the 
ORFs that encode proteins. 

A variety of methodologies known in the art can be utilized to obtain any one of the 
30 isolated polypeptides or proteins of the present invention. At the simplest level, the amino 
acid sequence can be synthesized using commercially available peptide synthesizers. The 
synthetically-constructed protein sequences, by virtue of sharing primary, secondary or 
tertiary structural and/or conformational characteristics with proteins may possess biological 
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properties in common therewith, including protein activity. This technique is particularly 
useful in producing small peptides and fragments of larger polypeptides. Fragments are 
useful, for example, in generating antibodies against the native polypeptide. Thus, they may 
be employed as biologically active or immunological substitutes for natural, purified 
5 proteins in screening of therapeutic compounds and in immunological processes for the 
development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified 
from cells which have been altered to express the desired polypeptide or protein. As used 
herein, a cell is said to be altered to express a desired polypeptide or protein when the cell, 

10 through genetic manipulation, is made to produce a polypeptide or protein which it normally 
does not produce or which the cell normally produces at a lower level. One skilled in the art 
can readily adapt procedures for introducing and expressing either recombinant or synthetic 
sequences into eukaryotic or prokaryotic cells in order to generate a cell which produces one 
of the polypeptides or proteins of the present invention. 

1 5 The invention also relates to methods for producing a polypeptide comprising 

growing a culture of host cells of the invention in a suitable culture medium, and purifying 
the protein from the cells or the culture in which the cells are grown. For example, the 
methods of the invention include a process for producing a polypeptide in which a host cell 
containing a suitable expression vector that includes a polynucleotide of the invention is 

20 cultured under conditions that allow expression of the encoded polypeptide. The 

polypeptide can be recovered from the culture, conveniently from the culture medium, or 
from a lysate prepared from the host cells and further purified. Preferred embodiments 
include those in which the protein produced by such process is a full length or mature form 
of the protein. 

25 hi an alternative method, the polypeptide or protein is purified from bacterial cells 

which naturally produce the polypeptide or protein. One skilled in the art can readily follow 
known methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or proteins of the present invention. These include, but are not limited to, 
immunochromatography, HPLC, size-exclusion chromatography, ion-exchange 

30 chromatography, and immuno-affinity chromatography. See, e.g., Scopes, Protein 
Purification: Principles and Practice, Springer-Verlag (1994); Sambrook, et al., in 
Molecular Cloning: A Laboratory Manual; Ausubel et al., Current Protocols in Molecular 
Biology. Polypeptide fragments that retain biological/immunological activity include 



WO 03/029271 PCT/US02/30474 

35 

fragments comprising greater than about 100 amino acids, or greater than about 200 amino 
acids, and fragments that encode specific protein domains. 

The purified polypeptides can be used in in vitro binding assays which are well 

4 

known in the art to identify molecules which bind to the polypeptides. These molecules 
5 include but are not limited to, for e.g., small molecules, molecules from combinatorial 

libraries, antibodies or other proteins. The molecules identified in the binding assay are then 
tested for antagonist or agonist activity in in vivo tissue culture or animal models that are 
well known in the art. In brief, the molecules are titrated into a plurality of cell cultures or 
animals and then tested for either cell/animal death or prolonged survival of the animal/cells. 

10 In addition, the peptides of the invention or molecules capable of binding to the 

peptides may be complexed with toxins, e.g., ricin or cholera, or with other compounds that 
are toxic to cells. The toxin-binding molecule complex is then targeted to a tumor or other 
cell by the specificity of the binding molecule for SEQ ID NO: 972-1942, or 2710-3476. 
The protein of the invention may also be expressed as a product of transgenic 

15 animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are 
characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

The proteins provided herein also include proteins characterized by amino acid 
sequences similar to those of purified proteins but into which modification are naturally 

20 provided or deliberately engineered. For example, modifications, in the peptide or DNA 

sequence, can be made by those skilled in the art using known techniques. Modifications of 
interest in the protein sequences may include the alteration, substitution, replacement, 
insertion or deletion of a selected amino acid residue in the coding sequence. For example, 
one or more of the cysteine residues may be deleted or replaced with another amino acid to 

25 alter the conformation of the molecule. Techniques for such alteration, substitution, 

replacement, insertion or deletion are well known to those skilled in the art (see, e.g., U.S. 
Pat. No. 4,518,584). Preferably, such alteration, substitution, replacement, insertion or 
deletion retains the desired activity of the protein. Regions of the protein that are important 
for the protein function can be determined by various methods known in the art including the 

30 alanine-scanning method which involved systematic substitution of single or strings of 
amino acids with alanine, followed by testing the resulting alanine-containing variant for 
biological activity. This type of analysis determines the importance of the substituted amino 
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acid(s) in biological activity. Regions of the protein that are important for protein function 
may be determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be 
expected to retain protein activity in whole or in part and are useful for screening or other 
5 immunological methodologies may also be easily made by those skilled in the art given the 
disclosures herein. Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of 
the invention to suitable control sequences in one or more insect expression vectors, and 
employing an insect expression system. Materials and methods for baculovirus/insect cell 

10 expression systems are commercially available in kit form from, e.g., frivitrogen, San Diego, 
Calif., U.S.A. (the MaxBat™ kit), and such methods are well known in the art, as described 
in Summers and Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), 
incorporated herein by reference. As used herein, an insect cell capable of expressing a 
polynucleotide of the present invention is "transformed." 

1 5 The protein of the invention may be prepared by culturing transformed host cells 

under culture conditions suitable to express the recombinant protein. The resulting 
expressed protein may then be purified from such culture (i.e., from culture medium or cell 
extracts) using known purification processes, such as gel filtration and ion exchange 
chromatography. The purification of the protein may also include an affinity column 

20 containing agents which will bind to the protein; one or more column steps over such affinity 
resins as concanavalin A-agarose, heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; 
one or more steps involving hydrophobic interaction chromatography using such resins as 
phenyl ether, butyl ether, or propyl ether; or immunoaffinity chromatography. 

Alternatively, the protein of the invention may also be expressed in a form which will 

25 facilitate purification. For example, it may be expressed as a fusion protein, such as those of 
maltose binding protein (MBP), glutathione-S -transferase (GST) or thioredoxin (TRX), or as 
a His tag. Kits for expression and purification of such fusion proteins are commercially 
available from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N J.) and 
Invitrogen, respectively. The protein can also be tagged with an epitope and subsequently 

30 purified by using a specific antibody directed to such epitope. One such epitope ("FLAG®") 
is commercially available from Kodak (New Haven, Conn.). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
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methyl or other aliphatic groups, can be employed to further purify the protein. Some or all 
of the foregoing purification steps, in various combinations, can also be employed to provide 
a substantially homogeneous isolated recombinant protein. The protein thus purified is 
substantially free of other mammalian proteins and is defined in accordance with the present 

5 invention as an "isolated protein." 

The polypeptides of the invention include analogs (variants). This embraces 
fragments, as well as peptides in which one or more amino acids has been deleted, inserted, 
or substituted. Also, analogs of the polypeptides of the invention embrace fusions of the 
polypeptides or modifications of the polypeptides of the invention, wherein the polypeptide 

1 0 or analog is fused to another moiety or moieties, e.g., targeting moiety or another therapeutic 
agent. Such analogs may exhibit improved properties such as activity and/or stability. 
Examples of moieties which may be fused to the polypeptide or an analog include, for 
example, targeting moieties which provide for the delivery of polypeptide to pancreatic cells, 
e.g., antibodies to pancreatic cells, antibodies to immune cells such as T-cells, monocytes, 

15 dendritic cells, granulocytes, etc., as well as receptor and ligands expressed on pancreatic or 
immune cells. Other moieties which may be fused to the polypeptide include therapeutic 
agents which are used for treatment, for example, immunosuppressive drugs such as 
cyclosporin, SK506, azathioprine, CD 3 antibodies and steroids. Also, polypeptides maybe 
fused to immune modulators, and other cytokines such as alpha or beta interferon. 

20 

4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE 
IDENTITY AND SIMILARITY 

Preferred identity and/or similarity are designed to give the largest match between 
the sequences tested. Methods to determine identity and similarity are codified in computer 

25 programs including, but are not limited to, the GCG program package, including GAP 

(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, 
University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, 
S.F. et al., J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al., Nucleic 
Acids Res. vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu 

30 et al., J. Comp. Biol., Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif 
software (Nevill-Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by 
reference), Pfam software (Sonnhammer et al., Nucleic Acids Res., Vol. 26(1), pp. 320-322 
(1998), herein incorporated by reference) and the Kyte-Doolittle hydrophobocity prediction 
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algorithm (J. Mol Biol, 157, pp. 105-31 (1982), the GeneAtlas software (Molecular 
Simulations Inc. (MSI), San Diego, CA) (Sanchez and Sali (1998) Proc. Natl. Acad. ScL, 95, 
13597-13602; Kitson DH et al, (2000) "Remote homology detection using structural 
modeling - an evaluation 55 Submitted; Fischer and Eisenberg (1996) Protein Sci. 5, 947- 
5 955), Neural Network SignalP VI . 1 program (from Center for Biological Sequence 
Analysis, The Technical University of Denmark) incorporated herein by reference). 
Polypeptide sequences were examined by a proprietary algorithm, SeqLoc that separates the 
proteins into three sets of locales: intracellular, membrane, or secreted. This prediction is 
based upon three characteristics of each polypeptide, including percentage of cysteine 

1 0 residues, Kyte-Doolittle scores for the first 20 amino acids of each protein, and Kyte- 

Doolittle scores to calculate the longest hydrophobic stretch of the said protein. Values of 
predicted proteins are compared against the values from a set of 592 proteins of known 
cellular localization from the Swissprot database ( http ://www.expasv.ch/sprot ). Predictions 
are based upon the maximum likelihood estimation. 

1 5 Presence of transmembrane region(s) was detected using the TMpred program 

(http://wwwxh.embnet.org/software/TMPRED form.html ). 

The BLAST programs are publicly available from the National Center for 
Biotechnology Information (NCBI) and other sources (BLAST Manual, Altschul, S., et al. 
NCBI NLM NTH Bethesda, MD 20894; Altschul, S., et al., J. Mol. Biol. 215:403-410 

20 (1990). 

4.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 
protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to 
another polypeptide. Within a fusion protein the polypeptide according to the invention can 

25 correspond to all or a portion of a protein according to the invention. In one embodiment, a 
fusion protein comprises at least one biologically active portion of a protein according to the 
invention. In another embodiment, a fusion protein comprises at least two biologically 
active portions of a protein according to the invention. Within the fusion protein, the term 
"operatively linked" is intended to indicate that the polypeptide according to the invention 

30 and the other polypeptide are fused in-frame to each other. The polypeptide can be fused to 
the N-terminus or C-terminus, or to the middle. 

For example, in one embodiment a fusion protein comprises a polypeptide according 
to the invention operably linked to the extracellular domain of a second protein. 
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In another embodiment, the fusion protein is a GST-fusion protein in which the 
polypeptide sequences of the invention are fused to the C-terminus of the GST (i.e., 
glutathione S -transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin fusion protein in 
5 which the polypeptide sequences according to the invention comprise one or more domains 
fused to sequences derived from a member of the immuno globulin protein family. The 
immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical 
compositions and administered to a subject to inhibit an interaction between a ligand and a 
protein of the invention on the surface of a cell, to thereby suppress signal transduction in 

1 0 vivo. The immunoglobulin fusion proteins can be used to affect the bioavailability of a 

cognate ligand. Inhibition of the ligand/protein interaction may be useful therapeutically for 
both the treatment of proliferative and differentiative disorders, e.g., cancer as well as 
modulating (e.g., promoting or inhibiting) cell survival. Moreover, the immunoglobulin 
fusion proteins of the invention can be used as immunogens to produce antibodies in a 

15 subject, to purify ligands, and in screening assays to identify molecules that inhibit the 
interaction of a polypeptide of the invention with a ligand. 

A chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 

20 techniques, e.g. , by employing blunt-ended or stagger-ended termini for ligation, restriction 
enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as 
appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic 
ligation. In another embodiment, the fusion gene can be synthesized by conventional 
techniques including automated DNA synthesizers. Alternatively, PGR amplification of 

25 gene fragments can be carried out using anchor primers that give rise to complementary 

overhangs between two consecutive gene fragments that can subsequently be annealed and 
reamplified to generate a chimeric gene sequence (see, for example, Ausubel et al. (eds.) 
Current Protocols in Molecular Biology, John Wiley & Sons, 1992). Moreover, 
many expression vectors are commercially available that already encode a fusion moiety 

30 (e.g., a GST polypeptide). A nucleic acid encoding a polypeptide of the invention can be 
cloned into such an expression vector such that the fusion moiety is linked in-frame to the 
protein of the invention. 



WO 03/029271 PCT/US02/30474 

40 

4.8 GENE THERAPY 

Mutations in the polynucleotides of the invention gene may result in loss of normal 
function of the encoded protein. The invention thus provides gene therapy to restore normal 
activity of the polypeptides of the invention; or to treat disease states involving polypeptides 
5 of the invention. Delivery of a functional gene encoding polypeptides of the invention to 
appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more 
particularly viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo 
by use of physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for 
example, Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). For 

10 additional reviews of gene therapy technology see Friedmann, Science, 244: 1275-1281 

(1989); Verma, Scientific American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). 
Introduction of any one of the nucleotides of the present invention or a gene encoding the 
polypeptides of the present invention can also be accomplished with extrachromosomal 
substrates (transient expression) or artificial chromosomes (stable expression). Cells may 

1 5 also be cultured ex vivo in the presence of proteins of the present invention in order to 

proliferate or to produce a desired effect on or activity in such cells. Treated cells can then 
be introduced in vivo for therapeutic purposes. Alternatively, it is contemplated that in other 
human disease states, preventing the expression of or inhibiting the activity of polypeptides 
of the invention will be useful in treating the disease states. It is contemplated that antisense 

20 therapy or gene therapy could be applied to negatively regulate the expression of 
polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introduction of antisense 
molecules to the nucleic acids of the present invention, their complements, or their translated 
RNA sequences, by methods known in the art. Further, the polypeptides of the present 

25 invention can be inhibited by using targeted deletion methods, or the insertion of a negative 
regulatory element such as a silencer, which is tissue specific. 

The present invention still further provides cells genetically engineered in vivo to 
express the polynucleotides of the invention, wherein such polynucleotides are in operative 
association with a regulatory sequence heterologous to the host cell which drives expression of 

30 the polynucleotides in the cell. These methods can be used to increase or decrease the 
expression of the polynucleotides of the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of 
cells to pennit, increase, or decrease, expression of endogenous polypeptide. Cells can be 
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modified (e.g., by homologous recombination) to provide increased polypeptide expression by 
replacing, in whole or in part, the naturally occurring promoter with all or part of a heterologous 
promoter so that the cells express the protein at higher levels. The heterologous promoter is 
inserted in such a manner that it is operatively linked to the desired protein encoding sequences. 
5 See, for example, PCT International Publication No. WO 94/12650, PCT International 

Publication No. WO 92/20808, and PCT International Publication No. WO 91/09955. It is also 
contemplated that, in addition to heterologous promoter DNA, amplifiable marker DNA (e.g., 
ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate synthase, 
aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be inserted along with 

1 0 the heterologous promoter DNA. If linked to the desired protein coding sequence, 

amplification of the marker DNA by standard selection methods results in co-amplification of 
the desired protein coding sequences in the cells. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides of the invention under the control 

1 5 of inducible regulatory elements, in which case the regulatory sequences of the endogenous 

gene may be replaced by homologous recombination. As described herein, gene targeting can 
be used to replace a gene's existing regulatory region with a regulatory sequence isolated from 
a different gene or a novel regulatory sequence synthesized by genetic engineering methods. 
Such regulatory sequences maybe comprised of promoters, enhancers, scaffold-attachment 

20 regions, negative regulatory elements, transcriptional initiation sites, regulatory protein binding 
sites or combinations of said sequences. Alternatively, sequences which affect the structure or 
stability of the RNA or protein produced may be replaced, removed, added, or otherwise 
modified by targeting. These sequences include polyadenylation signals, mRNA stability 
elements, splice sites, leader sequences for enhancing or modifying transport or secretion 

25 properties of the protein, or other sequences which alter or improve the function or stability of 

protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 
gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple 
30 deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory 
element. Alternatively, the targeting event may replace an existing element; for example, a 
tissue-specific enhancer can be replaced by an enhancer that has broader or different cell-type 
specificity than the naturally occurring elements. Here, the naturally occurring sequences are 
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deleted and new sequences are added. In all cases, the identification of the targeting event 
may be facilitated by the use of one or more selectable marker genes that are contiguous with 
the targeting DNA, allowing for the selection of cells in which the exogenous DNA has 
integrated into the cell genome. The identification of the targeting event may also be facilitated 
5 by the use of one or more marker genes exhibiting the property of negative selection, such that 
the negatively selectable marker is linked to the exogenous DNA, but configured such that the 
negatively selectable marker flanks the targeting sequence, and such that a correct homologous 
recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers useful for this purpose include the 

10 Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with 
this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 
Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. 

1 5 PCT/US92/09627 (WO93/09222) by Selden et al.; and International Application No. 
PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by 
reference herein in its entirety. 



4.9 TRANSGENIC ANIMALS 

20 In preferred methods to determine biological functions of the polypeptides of the 

invention in vivo, one or more genes provided by the invention are either over expressed or 
inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 

25 Animals in which an endogenous gene has been inactivated by homologous recombination 
are referred to as "knockout" animals. Knockout animals, preferably non-human mammals, 
can be prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. 
Transgenic animals are useful to determine the roles polypeptides of the invention play in 
biological processes, and preferably in disease states. Transgenic animals are useful as model 

30 systems to identify compounds that modulate lipid metabolism. Transgenic animals, 

preferably non-human mammals, are produced using methods as described in U.S. Patent No 
5,489,743 and PCT Publication No. W094/28122, incorporated herein by reference. 
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Transgenic animals can be prepared wherein all or part of a promoter of the 
polynucleotides of the invention is either activated or inactivated to alter the level of 
expression of the polypeptides of the invention. Inactivation can be carried out using 
homologous recombination methods described above. Activation can be achieved by 
5 supplementing or even replacing the homologous promoter to provide for increased protein 
expression. The homologous promoter can be supplemented by insertion of one or more 
heterologous enhancer elements known to confer promoter activation in a particular tissue. 

The polynucleotides of the present invention also make possible the development, 
through, e.g., homologous recombination or knock out strategies, of animals that fail to 

10 express polypeptides of the invention or that express a variant polypeptide. Such animals are 
useful as models for studying the in vivo activities of polypeptide as well as for studying 
modulators of the polypeptides of the invention. 

ha preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 

1 5 inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination 
are referred to as "knockout" animals. Knockout animals, preferably non-human mammals, 

20 can be prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. 
Transgenic animals are useful to determine the roles polypeptides of the invention play in 
biological processes, and preferably in disease states. Transgenic animals are useful as model 
systems to identify compounds that modulate lipid metabolism. Transgenic animals, 
preferably non-human mammals, are produced using methods as described in U.S. Patent No 

25 5,489,743 and PCT Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of the 
invention promoter is either activated or inactivated to alter the level of expression of the 
polypeptides of the invention. Inactivation can be carried out using homologous 
recombination methods described above. Activation can be achieved by supplementing or 

30 even replacing the homologous promoter to provide for increased protein expression. The 
homologous promoter can be supplemented by insertion of one or more heterologous 
enhancer elements known to confer promoter activation in a particular tissue. 
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4.10 USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit one 
or more of the uses or biological activities (including those associated with assays cited 
herein) identified herein. Uses or activities described for proteins of the present invention 
5 may be provided by administration or use of such proteins or of polynucleotides encoding 
such proteins (such as, for example, in gene therapies or vectors suitable for introduction of 
DNA). The mechanism underlying the particular condition or pathology will dictate whether 
the polypeptides of the invention, the polynucleotides of the invention or modulators 
(activators or inhibitors) thereof would be beneficial to the subject in need of treatment. 

10 Thus, "therapeutic compositions of the invention" include compositions comprising isolated 
polynucleotides (including recombinant DNA molecules, cloned genes and degenerate 
variants thereof) or polypeptides of the invention (including full length protein, mature 
protein and truncations or domains thereof), or compounds and other substances that 
modulate the overall activity of the target gene products, either at the level of target 

1 5 gene/protein expression or target protein activity. Such modulators include polypeptides, 
analogs, (variants), including fragments and fusion proteins, antibodies and other binding 
proteins; chemical compounds that directly or indirectly activate or inhibit the polypeptides 
of the invention (identified, e.g., via drug screening assays as described herein); antisense 
polynucleotides and polynucleotides suitable for triple helix formation; and in particular 

20 antibodies or other binding partners that specifically recognize one or more epitopes of the 
polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular 
activation or in one of the other physiological pathways described herein. 



25 4.10.1 RESEARCH USES AND UTILITIES 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express recombinant 
protein for analysis, characterization or therapeutic use; as markers for tissues in which the 
corresponding protein is preferentially expressed (either constitutively or at a particular stage 
30 of tissue differentiation or development or in disease states); as molecular weight markers on 
gels; as chromosome markers or tags (when labeled) to identify chromosomes or to map 
related gene positions; to compare with endogenous DNA sequences in patients to identify 
potential genetic disorders; as probes to hybridize and thus discover novel, related DNA 
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sequences; as a source of information to derive PGR primers for genetic fingerprinting; as a 
probe to "subtract-out" known sequences in the process of discovering other novel 
polynucleotides; for selecting and making oligomers for attachment to a "gene chip" or other 
support, including for examination of expression patterns; to raise anti-protein antibodies 
5 using DNA immunization techniques; and as an antigen to raise anti-DNA antibodies or 

elicit another immune response. Where the polynucleotide encodes a protein which binds or 
potentially binds to another protein (such as, for example, in a receptor-ligand interaction), 
the polynucleotide can also be used in interaction trap assays (such as, for example, that 
described in Gyuris et aL, Cell 75:791-803 (1993)) to identify polynucleotides encoding the 

1 0 other protein with which binding occurs or to identify inhibitors of the binding interaction. 

The polypeptides provided by the present invention can similarly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a reagent (including 
the labeled reagent) in assays designed to quantitatively determine levels of the protein (or 

15 its receptor) in biological fluids; as markers for tissues in which the corresponding 

polypeptide is preferentially expressed (either constitutively or at a particular stage of tissue 
differentiation or development or in a disease state); and, of course, to isolate correlative 
receptors or ligands. Proteins involved in these binding interactions can also be used to 
screen for peptide or small molecule inhibitors or agonists of the binding interaction. 

20 Any or all of these research utilities are capable of being developed into reagent 

grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in the 
art. References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. 

25 Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular 
Cloning Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

4.10.2 NUTRITIONAL USES 

* 

Polynucleotides and polypeptides of the present invention can also be used as 
30 nutritional sources or supplements. Such uses include without limitation use as a protein or 

amino acid supplement, use as a carbon source, use as a nitrogen source and use as a source of 
carbohydrate. In such cases the polypeptide or polynucleotide of the invention can be added to 
the feed of a particular organism or can be administered as a separate solid or liquid 
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preparation, such as in the form of powder, pills, solutions, suspensions or capsules. In the 
case of microorganisms, the polypeptide or polynucleotide of the invention can be added to the 
medium in or on which the microorganism is cultured. 

5 4.10.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 

ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, cell 
proliferation (either inducing or inhibiting) or cell differentiation (either inducing or 
inhibiting) activity or may induce production of other cytokines in certain cell populations. 

10 A polynucleotide of the invention can encode a polypeptide exhibiting such attributes. 
Many protein factors discovered to date, including all known cytokines, have exhibited 
activity in one or more factor-dependent cell proliferation assays, and hence the assays serve 
as a convenient confirmation of cytokine activity. The activity of therapeutic compositions 
of the present invention is evidenced by any one of a number of routine factor dependent cell 

15 proliferation assays for cell lines including, without limitation, 3 2D, DA2, DA1G, T-10, B9, 
B9/1 1, BaF3, MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, Tl 165, HT2, CTLL2, TF-1, 
Mo7e, CMK, HUVEC, and Caco. Therapeutic compositions of the invention can be used in 
the following: 

Assays for T-cell or thymocyte proliferation include without limitation those 
20 described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. 

Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 

Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 

Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 

Bertagnolli et al., J. Immunol. 145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 
25 133:327-341, 1991; Bertagnolli, et al., I. Immunol. 149:3778-3783, 1992; Bowman et al., I. 

Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node cells 

or thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 

Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan 
30 eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of 

mouse and human interleukin-y, Schreiber, R. D. In Current Protocols in Immunology. J. E. 

e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 



WO 03/029271 PCT/US02/30474 

47 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic 
cells include, without limitation, those described in: Measurement of Human and Murine 
Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. Li Current 
Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and 
5 Sons, Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al., 

Nature 336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 
1983; Measurement of mouse and human interleukin 6-Nordan, R. In Current Protocols in 
Immunology. J. E. Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; 
Smith et al., Proc. Natl. Aced. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human 

10 Interleukin 1 1-Bennett, F., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols 
in Immunology. J. E. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; 
Measurement of mouse and human Interleukin 9— Ciarletta, A., Giannotti, J., Clark, S. C. 
and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.13.1, 
John Wiley and Sons, Toronto. 1991. 

15 Assays for T-cell clone responses to antigens (which will identify, among others, 

proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring 
proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, 
E. M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience 

20 (Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and their 
cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et al., Proc. 
Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et al., Eur. J. hnmun. 11:405-411, 
1981; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 
1988. 

25 

4.10.4 STEM CELL GROWTH FACTOR ACTIVITY 

A polypeptide of the present invention may exhibit stem cell growth factor activity 
and be involved in the proliferation, differentiation and survival of pluripotent and totipotent 
stem cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells 
30 and/or germ line stem cells. Administration of the polypeptide of the invention to stem cells 
in vivo or ex vivo is expected to maintain and expand cell populations in a totipotential or 
pluripotential state which would be useful for re-engineering damaged or diseased tissues, 
transplantation, manufacture of bio-pharmaceuticals and the development of bio-sensors. 
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The ability to produce large quantities of human cells has important working applications 
for the production of human proteins which currently must be obtained from non-human 
sources or donors, implantation of cells to treat diseases such as Parkinson's, Alzheimer's 
and other neurodegenerative diseases; tissues for grafting such as bone marrow, skin, 
5 cartilage, tendons, bone, muscle (including cardiac muscle), blood vessels, cornea, neural 
cells, gastrointestinal cells and others; and organs for transplantation such as kidney, liver, 
pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines 
may be administered in combination with the polypeptide of the invention to achieve the 

10 desired effect, including any of the growth factors listed herein, other stem cell maintenance 
factors, and specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), 
Flt-3 ligand (Flt-3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL- 
6, macrophage inflammatory protein 1 -alpha (MIP-1 -alpha), G-CSF, GM-CSF, 
thrombopoietin (TPO), platelet factor 4 (PF-4), platelet-derived growth factor (PDGF), 

15 neural growth factors and basic fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion 
of these cells in culture will facilitate the production of large quantities of mature cells. 
Techniques for culturing stem cells are known in the art and administration of polypeptides 
of the invention, optionally with other growth factors and/or cytokines, is expected to 

20 enhance the survival and proliferation of the stem cell populations. This can be 

accomplished by direct administration of the polypeptide of the invention to the culture 
medium. Alternatively, stroma cells transfected with a polynucleotide that encodes for the 
polypeptide of the invention can be used as a feeder layer for the stem cell populations in 
culture or in vivo. Stromal support cells for feeder layers may include embryonic bone 

25 marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or cultured embryonic 
fibroblasts (see U.S. Patent No. 5,690,926). 

Stem cells themselves can be transfected with a polynucleotide of the invention to 
induce autocrine expression of the polypeptide of the invention. This will allow for 
generation of undifferentiated totipotential/pluripotential stem cell lines that are useful as is 

30 or that can then be differentiated into the desired mature cell types. These stable cell lines 
can also serve as a source of undifferentiated totipotential/pluripotential mRNA to create 
cDNA libraries and templates for polymerase chain reaction experiments. These studies 
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would allow for the isolation and identification of differentially expressed genes in stem 
cell populations that regulate stem cell proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem cell populations will be useful in the 
treatment of many pathological conditions. For example, polypeptides of the present 
5 invention may be used to manipulate stem cells in culture to give rise to neuroepithelial cells 
that can be used to augment or replace cells damaged by illness, autoimmune disease, 
accidental damage or genetic disorders. The polypeptide of the invention may be useful for 
inducing the proliferation of neural cells and for the regeneration of nerve and brain tissue, 
i.e. for the treatment of central and peripheral nervous system diseases and neuropathies, as 

10 well as mechanical and traumatic disorders which involve degeneration, death or trauma to 
neural cells or nerve tissue. In addition, the expanded stem cell populations can also be 
genetically altered for gene therapy purposes and to decrease host rejection of replacement 
tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 

15 manipulated to achieve controlled differentiation of the stem cells into more differentiated 
cell types. A broadly applicable method of obtaining pure populations of a specific 
differentiated cell type from undifferentiated stem cell populations involves the use of a cell- 
type specific promoter driving a selectable marker. The selectable marker allows only cells 
of the desired type to survive. For example, stem cells can be induced to differentiate into 

20 cardiomyocytes (Wobus et al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. 

Invest., 98(1): 216-224, (1998)) or skeletal muscle cells (Browder, L. W. In: Principles of 
Tissue Engineering eds. Lanza et al., Academic Press (1997)). Alternatively, directed 
differentiation of stem cells can be accomplished by culturing the stem cells in the presence 
of a differentiation factor such as retinoic acid and an antagonist of the polypeptide of the 

25 invention which would inhibit the effects of endogenous stem cell factor activity and allow 
differentiation to proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the 
invention exhibits stem cell growth factor activity. Stem cells are isolated from any one of 
various cell sources (including hematopoietic stem cells and embryonic stem cells) and 

30 cultured on a feeder layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 
92: 7844-7848 (1995), in the presence of the polypeptide of the invention alone or in 
combination with other growth factors or cytokines. The ability of the polypeptide of the 
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invention to induce stem cells proliferation is determined by colony formation on semi- 
solid support e.g. as described by Bernstein et ah, Blood, 77: 2316-2321 (1991). 

4.10.5 HEMATOPOIESIS REGULATING ACTIVITY 

5 A polypeptide of the present invention may be involved in regulation of 

hematopoiesis and, consequently, in the treatment of myeloid or lymphoid cell disorders. 
Even marginal biological activity in support of colony forming cells or of factor-dependent 
cell lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth 
and proliferation of erythroid progenitor cells alone or in combination with other cytokines, 

10 thereby indicating utility, for example, in treating various anemias or for use in conjunction 
with irradiation/chemotherapy to stimulate the production of erythroid precursors and/or 
erythroid cells; in supporting the growth and proliferation of myeloid cells such as 
granulocytes and monocytes/macrophages (i.e., traditional CSF activity) useful, for example, 
in conjunction with chemotherapy to prevent or treat consequent myelo-suppression; in 

1 5 supporting the growth and proliferation of megakaryocytes and consequently of platelets 
thereby allowing prevention or treatment of various platelet disorders such as 
thrombocytopenia, and generally for use in place of or complimentary to platelet 
transfusions; and/or in supporting the growth and proliferation of hematopoietic stem cells 
which are capable of maturing to any and all of the above-mentioned hematopoietic cells and 

20 therefore find therapeutic utility in various stem cell disorders (such as those usually treated 
with transplantation, including, without limitation, aplastic anemia and paroxysmal nocturnal 
hemoglobinuria), as well as in repopulating the stem cell compartment post 
irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell transplantation (homologous or 

25 heterologous)) as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
Suitable assays for proliferation and differentiation of various hematopoietic lines are 
cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
30 proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al, 
Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 81:2903-2915, 
1993. 
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Assays for stem cell survival and differentiation (which will identify, among others, 
proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: 
Methylcellulose colony forming assays, Fresliney, M. G. In Culture of Hematopoietic Cells. 
R. I. Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; 
5 Hirayama et al., Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic 
colony forming cells with high proliferative potential, McNiece, I. K. and Briddell, R. A. In 
Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., 
New York, N.Y. 1994; Neben et al., Experimental Hematology 22:353-359, 1994; 
Cobblestone area forming cell assay, Ploemacher, R. E. In Culture of Hematopoietic Cells. 

10 R. I. Freshney, et al. eds. Vol pp. 1-21, Wiley-Liss, Inc., New York, N.Y. 1994; Long term 
bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and Allen, 
T. hi Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, 
Inc., New York, N.Y. 1994; Long term culture initiating cell assay, Sutherland, H. J. hi 
Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 139-162, Wiley-Liss, Inc., 

15 New York, N.Y. 1994. 

4.10.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, 
tendon, ligament and/or nerve tissue growth or regeneration, as well as in wound healing and 

20 tissue repair and replacement, and in healing of burns, incisions and ulcers. 

A polypeptide of the present invention which induces cartilage and/or bone growth in 
circumstances where bone is not normally formed, has application in the healing of bone 
fractures and cartilage damage or defects in humans and other animals. Compositions of a 
polypeptide, antibody, binding partner, or other modulator of the invention may have 

25 prophylactic use in closed as well as open fracture reduction and also in the improved 
fixation of artificial joints. De novo bone formation induced by an osteogenic agent 
contributes to the repair of congenital, trauma induced, or oncologic resection induced 
craniofacial defects, and also is useful in cosmetic plastic surgery. 

A polypeptide of this invention may also be involved in attracting bone-forming 

30 cells, stimulating growth of bone-forming cells, or inducing differentiation of progenitors of 
bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by 
blocking inflammation or processes of tissue destruction (collagenase activity, osteoclast 
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activity, etc.) mediated by inflammatory processes may also be possible using the 
composition of the invention. 

Another category of tissue regeneration activity that may involve the polypeptide of 
the present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue 
5 or other tissue formation in circumstances where such tissue is not normally formed, has 
application in the healing of tendon or ligament tears, deformities and other tendon or 
ligament defects in humans and other animals. Such a preparation employing a 
tendon/ligament-like tissue inducing protein may have prophylactic use in preventing 
damage to tendon or ligament tissue, as well as use in the improved fixation of tendon or 

10 ligament to bone or other tissues, and in repairing defects to tendon or ligament tissue. De 
novo tendon/ligament-like tissue formation induced by a composition of the present 
invention contributes to the repair of congenital, trauma induced, or other tendon or ligament 
defects of other origin, and is also useful in cosmetic plastic surgery for attachment or repair 
of tendons or ligaments. The compositions of the present invention may provide 

1 5 environment to attract tendon- or ligament- forming cells, stimulate growth of tendon- or 

ligament-forming cells, induce differentiation of progenitors of tendon- or ligament-forming 
cells, or induce growth of tendon/ligament cells or progenitors ex vivo for return in vivo to 
effect tissue repair. The compositions of the invention may also be useful in the treatment of 
tendinitis, carpal tunnel syndrome and other tendon or ligament defects. The compositions 

20 may also include an appropriate matrix and/or sequestering agent as a carrier as is well 
known in the art. 

The compositions of the present invention may also be useful for proliferation of 
neural cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as well as mechanical and 

25 traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve 
tissue. More specifically, a composition maybe used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and 
localized neuropathies, and central nervous system diseases, such as Alzheimer's, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager 

30 syndrome. Further conditions which may be treated in accordance with the present invention 
include mechanical and traumatic disorders, such as spinal cord disorders, head trauma and 
cerebrovascular diseases such as stroke. Peripheral neuropathies resulting from 
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chemotherapy or other medical therapies may also be treatable using a composition of the 
invention. 

Compositions of the invention may also be useful to promote better or faster closure 
of non-healing wounds, including without limitation pressure ulcers, ulcers associated with 
5 vascular insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 
regeneration of other tissues, such as organs (including, for example, pancreas, liver, 
intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular 
(including vascular endothelium) tissue, or for promoting the growth of cells comprising 
1 0 such tissues. Part of the desired effects may be by inhibition or modulation of fibrotic 

scarring may allow normal tissue to regenerate. A polypeptide of the present invention may 
also exhibit angiogenic activity. 

A composition of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 
15 conditions resulting from systemic cytokine damage. 

A composition of the present invention may also be useful for promoting or 
inhibiting differentiation of tissues described above from precursor tissues or cells; or for 
inhibiting the growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 
20 Assays for tissue generation activity include, without limitation, those described in: 

International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International 
Patent Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 
W09 1/07491 (skin, endothelium). 

Assays for wound healing activity include, without limitation, those described in: 
25 Winter, Epidermal Wound Healing, pps. 71-1 12 (Maibach, H. I. and Rovee, D. T., eds.), 

Year Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 
Dermatol 71:382-84 (1978). 

4.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

30 A polypeptide of the present invention may also exhibit immune stimulating or 

immune suppressing activity, including without limitation the activities for which assays are 
described herein. A polynucleotide of the invention can encode a polypeptide exhibiting 
such activities. A protein may be useful in the treatment of various immune deficiencies and 
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disorders (including severe combined immunodeficiency (SCID)), e.g., in regulating (up or 
down) growth and proliferation of T and/or B lymphocytes, as well as effecting the cytolytic 
activity of NK cells and other cell populations. These immune deficiencies maybe genetic or 
be caused by viral (e.g., HIV) as well as bacterial or fungal infections, or may result from 
5 autoimmune disorders. More specifically, infectious diseases causes by viral, bacterial, 

fungal or other infection may be treatable using a protein of the present invention, including 
infections by HIV, hepatitis viruses, herpes viruses, mycobacteria, Leishmania spp., malaria 
spp. and various fungal infections such as candidiasis. Of course, in this regard, proteins of 
the present invention may also be useful where a boost to the immune system generally may 

10 be desirable, i.e., in the treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus 
erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre 
syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, 

15 graft-versus-host disease and autoimmune inflammatory eye disease. Such a protein (or 

antagonists thereof, including antibodies) of the present invention may also to be useful in 
the treatment of allergic reactions and conditions (e.g., anaphylaxis, serum sickness, drug 
reactions, food allergies, insect venom allergies, mastocytosis, allergic rhinitis, 
hypersensitivity pneumonitis, urticaria, angioedema, eczema, atopic dermatitis, allergic 

20 contact dermatitis, erythema multiforme, Stevens- Johnson syndrome, allergic conjunctivitis, 
atopic keratoconjunctivitis, venereal keratoconjunctivitis, giant papillary conjunctivitis and 
contact allergies), such as asthma (particularly allergic asthma) or other respiratory 
problems. Other conditions, in which immune suppression is desired (including, for 
example, organ transplantation), may also be treatable using a protein (or antagonists 

25 thereof) of the present invention. The therapeutic effects of the polypeptides or antagonists 
thereof on allergic reactions can be evaluated by in vivo animals models such as the 
cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 1998), skin 
prick test (Hoffmann et al., Allergy 54: 446-54, 1999), guinea pig skin sensitization test 
(Vohr et al., Arch. Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., 

30 J. Toxicol. Environ. Health 53: 563-79). 

Using the proteins of the invention it may also be possible to modulate immune 
responses, in a number of ways. Down regulation may be in the form of inhibiting or 
blocking an immune response already in progress or may involve preventing the induction of 
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an immune response. The functions of activated T cells may be inhibited by suppressing T 
cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression of T 
cell responses is generally an active, non-antigen-specific, process which requires continuous 
exposure of the T cells to the suppressive agent. Tolerance, which involves inducing 
5 non-responsiveness or anergy in T cells, is distinguishable from immunosuppression in that 
it is generally antigen-specific and persists after exposure to the tolerizing agent has ceased. 
Operationally, tolerance can be demonstrated by the lack of a T cell response upon 
reexposure to specific antigen in the absence of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 

1 0 limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high 
level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin 
and organ transplantation and in graft- versus-host disease (GVHD). For example, blockage 
of T cell function should result in reduced tissue destruction in tissue transplantation. 
Typically, in tissue transplants, rejection of the transplant is initiated through its recognition 

15 as foreign by T cells, followed by an immune reaction that destroys the transplant. The 

administration of a therapeutic composition of the invention may prevent cytokine synthesis 
by immune cells, such as T cells, and thus acts as an immunosuppressant. Moreover, a lack 
of costimulation may also be sufficient to anergize the T cells, thereby inducing tolerance in 
a subject. Induction of long-term tolerance by B lymphocyte antigen-blocking reagents may 

20 avoid the necessity of repeated administration of these blocking reagents. To achieve 

sufficient immunosuppression or tolerance in a subject, it may also be necessary to block the 
function of a combination of B lymphocyte antigens. 

The efficacy of particular therapeutic compositions in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 

25 humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been 
used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as 
described in Lenschow et al., Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. 
Sci USA, 89:11102-11105 (1992). In addition, murine models of GVHD (see Paul ed., 

30 Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to 

determine the effect of therapeutic compositions of the invention on the development of that 
disease. 
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Blocking antigen function may also be therapeutically useful for treating 
autoimmune diseases. Many autoimmune disorders are the result of inappropriate activation 
of T cells that are reactive against self-tissue and which promote the production of cytokines 
and autoantibodies involved in the pathology of the diseases. Preventing the activation of 
5 autoreactive T cells may reduce or eliminate disease symptoms. Administration of reagents 
which block stimulation of T cells can be used to inhibit T cell activation and prevent 
production of autoantibodies or T cell-derived cytokines which may be involved in the 
disease process. Additionally, blocking reagents may induce antigen-specific tolerance of 
autoreactive T cells which could lead to long-term relief from the disease. The efficacy of 
10 blocking reagents in preventing or alleviating autoimmune disorders can be determined 

« 

using a number of well-characterized animal models of human autoimmune diseases. 
Examples include murine experimental autoimmune encephalitis, systemic lupus 
erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune collagen 
arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental myasthenia 
15 gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 
840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a 
means of up regulating immune responses, may also be useful in therapy. Upregulation of 
immune responses may be in the form of enhancing an existing immune response or eliciting 
20 an initial immune response. For example, enhancing an immune response may be useful in 
cases of viral infection, including systemic viral diseases such as influenza, the common 

cold, and encephalitis. 

Alternatively, anti-viral immune responses maybe enhanced in an infected patient by 
removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 

25 APCs either expressing a peptide of the present invention or together with a stimulatory 

form of a soluble peptide of the present invention and reintroducing the in vitro activated T 
cells into the patient. Another method of enhancing anti- viral immune responses would be to 
isolate infected cells from a patient, transfect them with a nucleic acid encoding a protein of 
the present invention as described herein such that the cells express all or a portion of the 

30 protein on their surface, and reintroduce the transfected cells into the patient. The infected 
cells would now be capable of delivering a costimulatory signal to, and thereby activate, T 
cells in vivo. 
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A polypeptide of the present invention may provide the necessary stimulation signal 
to T cells to induce a T cell mediated immune response against the transfected tumor cells. 
In addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to 
reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected 
5 with nucleic acid encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) 
of an MHC class I alpha chain protein and p 2 microglobulin protein or an MHC class II 
alpha chain protein and an MHC class II beta chain protein to thereby express MHC class I 
or MHC class II proteins on the cell surface. Expression of the appropriate class I or class II 
MHC in conjunction with a peptide having the activity of a B lymphocyte antigen (e.g., 

10 B7-1, B7-2, B7-3) induces a T cell mediated immune response against the transfected tumor 
cell. Optionally, a gene encoding an antisense construct which blocks expression of an MHC 
class II associated protein, such as the invariant chain, can also be cotransfected with a DNA 
encoding a peptide having the activity of a B lymphocyte antigen to promote presentation of 
tumor associated antigens and induce tumor specific immunity. Thus, the induction of a T 

1 5 cell mediated immune response in a human subject may be sufficient to overcome 
tumor-specific tolerance in the subject. 

The activity of a protein of the invention may, among other means, be measured by 

the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 

20 those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, 
D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 
78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 

25 Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 137:3494-3500, 1986; Takai et al., 
J. Immunol. 140:508-512, 1988; Bowman et al., J. Virology 61:1992-1998; Bertagnolli et 
al., Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 
1994. 

Assays for T-cell-dependent immunoglobulin responses and isotype switching 
30 (which will identify, among others, proteins that modulate T-cell dependent antibody 

responses and that affect Thl/Th2 profiles) include, without limitation, those described in: 
Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro 
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antibody production, Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. 
J. E. e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 
proteins that generate predominantly Thl and CTL responses) include, without limitation, 
5 those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, 
D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 
Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 
10 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins 
expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of 
Experimental Medicine 173:549-559, 1991; Macatonia et al., Journal of Immunology 

15 154:5071-5079, 1995; Porgador et al., Journal of Experimental Medicine 182:255-260, 
1995; Nair et al., Journal of Virology 67:4062-4069, 1993; Huang et al., Science 
264:961-965, 1994; Macatonia et al., Journal of Experimental Medicine 169:1255-1264, 
1989; Bhardwaj et al., Journal of Clinical Investigation 94:797-807, 1994; and Inaba et al., 
Journal of Experimental Medicine 172:631-640, 1990. 

20 Assays for lymphocyte survival/apoptosis (which will identify, among others, 

proteins that prevent apoptosis after superantigen induction and proteins that regulate 
lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz et 
al., Cytometry 13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et 
al., Cancer Research 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, 

25 Journal of Immunology 145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; 
Gorczyca et al., International Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and development 
include, without limitation, those described in: Antica et al., Blood 84:111-117, 1994; Fine 
et al., Cellular Immunology 155:111-122, 1994; Galy et al., Blood 85:2770-2778, 1995; 

30 Toki et al., Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 



4.10.8 ACTIVIN/INHIBIN ACTIVITY 
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A polypeptide of the present invention may also exhibit activin- or inhibin-related 
activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to stimulate 
5 the release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present 

invention, alone or in heterodimers with a member of the inhibin family, may be useful as a 
contraceptive based on the ability of inhibins to decrease fertility in female mammals and 
decrease spermatogenesis in male mammals. Administration of sufficient amounts of other 
inhibins can induce infertility in these mammals. Alternatively, the polypeptide of the 

1 0 invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin 
group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 
molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, 
U.S. Pat. No. 4,798,885. A polypeptide of the invention may also be useful for advancement 
of the onset of fertility in sexually immature mammals, so as to increase the lifetime 

15 reproductive performance of domestic animals such as, but not limited to, cows, sheep and 
pigs. 

The activity of a polypeptide of the invention may, among other means, be measured 

by the following methods. 

Assays for activin/inhibin activity include, without limitation, those described in: 
20 Vale et al., Endocrinology 91 :562-572, 1972; Ling et al., Nature 321 :779-782, 1986; Vale et 
al., Nature 321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. 
Natl. Acad. Sci. USA 83:3091-3095, 1986. 

4.10.9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY 

25 A polypeptide of the present invention may be involved in chemotactic or 

chemokinetic activity for mammalian cells, including, for example, monocytes, fibroblasts, 
neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A 
polynucleotide of the invention can encode a polypeptide exhibiting such attributes. 
Chemotactic and chemokinetic receptor activation can be used to mobilize or attract a 

30 desired cell population to a desired site of action. Chemotactic or chemokinetic compositions 
(e.g. proteins, antibodies, binding partners, or modulators of the invention) provide particular 
advantages in treatment of wounds and other trauma to tissues, as well as in treatment of 
localized infections. For example, attraction of lymphocytes, monocytes or neutrophils to 
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tumors or sites of infection may result in improved immune responses against the tumor or 
infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, directly or indirectly, the directed orientation or movement of such cell 
population. Preferably, the protein or peptide has the ability to directly stimulate directed 
movement of cells. Whether a particular protein has chemotactic activity for a population of 
cells can be readily determined by employing such protein or peptide in any known assay for 
cell chemotaxis. 

Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or prevent 
chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 
population to another cell population. Suitable assays for movement and adhesion include, 
without limitation, those described in: Current Protocols in Immunology, Ed by J. E. 
Coligan, A. M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene 
Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta 
Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. 
APMIS 103:140-146, 1995; Muller et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of 
Immunol. 152:5860-5867, 1994; Johnston et al. J. of Immunol. 153:1762-1768, 1994. 

4.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 
thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 
attributes. Compositions may be useful in treatment of various coagulation disorders 
(including hereditary disorders, such as hemophilias) or to enhance coagulation and other 
hemostatic events in treating wounds resulting from trauma, surgery or other causes. A 
composition of the invention may also be useful for dissolving or inhibiting formation of 
thromboses and for treatment and prevention of conditions resulting therefrom (such as, for 
example, infarction of cardiac and central nervous system vessels (e.g., stroke). 
Therapeutic compositions of the invention can be used in the following: 
Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis 
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Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, 
Prostaglandins 35:467-474, 1988. 

4.10.11 CANCER DIAGNOSIS AND THERAPY 

5 Polypeptides of the invention may be involved in cancer cell generation, proliferation 

or metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 
invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. 
For example, the presence or increased expression of a polynucleotide/polypeptide of the 
invention may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing 
1 0 malignancy. Conversely, a defect in the gene or absence of the polypeptide may be 

associated with a cancer condition. Identification of single nucleotide polymorphisms 
associated with cancer or a predisposition to cancer may also be useful for diagnosis or 
prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 

15 inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor 
growth) and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. 
Therapeutic compositions of the invention may be effective in adult and pediatric oncology 
including in solid phase tumors/malignancies, locally advanced tumors, human soft tissue 
sarcomas, metastatic cancer, including lymphatic metastases, blood cell malignancies 

20 including multiple myeloma, acute and chronic leukemias, and lymphomas, head and neck 
cancers including mouth cancer, larynx cancer and thyroid cancer, lung cancers including 
small cell carcinoma and non-small cell cancers, breast cancers including small cell 
carcinoma and ductal carcinoma, gastrointestinal cancers including esophageal cancer, 
stomach cancer, colon cancer, colorectal cancer and polyps associated with colorectal 

25 neoplasia, pancreatic cancers, liver cancer, urologic cancers including bladder cancer and 

prostate cancer, malignancies of the female genital tract including ovarian carcinoma, uterine 
(including endometrial) cancers, and solid tumor in the ovarian follicle, kidney cancers 
including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central 

30 nervous system, bone cancers including osteomas, skin cancers including malignant 

melanoma, tumor progression of human skin keratinocytes, squamous cell carcinoma, basal 
cell carcinoma, hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention 
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(including inhibitors and stimulators of the biological activity of the polypeptide of the 
invention) may be administered to treat cancer. Therapeutic compositions can be 
administered in therapeutically effective dosages alone or in combination with adjuvant 
cancer therapy such as surgery, chemotherapy, radiotherapy, thermotherapy, and laser 
5 therapy, and may provide a beneficial effect, e.g. reducing tumor size, slowing rate of tumor 
growth, inhibiting metastasis, or otherwise improving overall clinical condition, without 

necessarily eradicating the cancer. 

The composition can also be administered in therapeutically effective amounts as a 
portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or 

1 0 modulator of the invention with one or more anti-cancer drugs in addition to a 

pharmaceutically acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer 
treatment is routine. Anti-cancer drugs that are well known in the art and can be used as a 
treatment in combination with the polypeptide or modulator of the invention include: 
Actinomycin D, Aminoglutethimide, Asparaginase, Bleomycin, Busulfan, Carboplatin, 

1 5 Carmustine, Chlorambucil, Cisplatin (cis-DDP), Cyclophosphamide, Cytarabine HC1 

(Cytosine arabinoside), Dacarbazine, Dactinomycin, Daunorubicin HC1, Doxorubicin HC1, 
Estramustine phosphate sodium, Etoposide (V16-213), Floxuridine, 5-Fluorouracil (5-Fu), 
Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, Interferon Alpha-2a, Interferon 
Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), Lomustine, Mechlorethamine 

20 HC1 (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, Methotrexate (MTX), 

Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, Procarbazine HC1, Streptozocin, 
Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, 
Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, 
Semustine, Teniposide, and Vindesine sulfate. 

25 hi addition, therapeutic compositions of the invention may be used for prophylactic 

treatment of cancer. There are hereditary conditions and/or environmental situations (e.g. 
exposure to carcinogens) known in the art that predispose an individual to developing 
cancers. Under these circumstances, it may be beneficial to treat these individuals with 
therapeutically effective doses of the polypeptide of the invention to reduce the risk of 

30 developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. These in vitro models include proliferation assays 
of cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1987) 
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Culture of Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 18 
and Ch 21), tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. hist., 
52: 921-30 (1974), mobility and invasive potential of tumor cells in Boyden Chamber assays 
as described in Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis 
assays such as induction of vascularization of the chick chorioallantoic membrane or 
induction of vascular endothelial cell migration as described in Ribatta et al., Intl. J. Dev. 
Biol., 40: 1189-97 (1999) and Li et al., Clin. Exp. Metastasis, 17:423-9 (1999), respectively. 
Suitable tumor cells lines are available, e.g. from American Type Tissue Culture Collection 
catalogs. 

4.10.12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 

i 

receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of 
the invention can encode a polypeptide exhibiting such characteristics. Examples of such 
receptors and ligands include, without limitation, cytokine receptors and their ligands, 
receptor kinases and their ligands, receptor phosphatases and their ligands, receptors 
involved in cell-cell interactions and their ligands (including without limitation, cellular 
adhesion molecules (such as selectins, integrins and their ligands) and receptor/ligand pairs 
involved in antigen presentation, antigen recognition and development of cellular and 
humoral immune responses. Receptors and ligands are also useful for screening of potential 
peptide or small molecule inhibitors of the relevant receptor/ligand interaction. A protein of 
the present invention (including, without limitation, fragments of receptors and ligands) may 
themselves be useful as inhibitors of receptor/ligand interactions. 

The activity of a polypeptide of the invention may, among other means, be measured 

by the following methods: 

Suitable assays for receptor-ligand activity include without limitation those described 
in: Current Protocols in Immunology, Ed by I. E. Coligan, A. M. Kruisbeek, D. H. 
Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static conditions 
7.28.1- 7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., 
J. Exp. Med. 168:1145-1156, 1988; Rosenstein et al., J. Exp. Med. 169:149-160 1989; 
Stoltenborg et al., I. Immunol. Methods 175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995. 
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By way of example, the polypeptides of the invention may be used as a receptor for 
a ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be 
identified through binding assays, affinity chromatography, dihybrid screening assays, 
BIAcore assays, gel overlay assays, or other methods known in the art. 
5 Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or 

a partial antagonist require the use of other proteins as competing ligands. The polypeptides 
of the present invention or ligand(s) thereof may be labeled by being coupled to 
radioisotopes, colorimetric molecules or a toxin molecules by conventional methods. 
("Guide to Protein Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 
10 (1990) Academic Press, Inc. San Diego). Examples of radioisotopes include, but are not 

limited to, tritium and carbon- 14 . Examples of colorimetric molecules include, but are not 
limited to, fluorescent molecules such as fluorescamine, or rhodamine or other colorimetric 
molecules. Examples of toxins include, but are not limited, to ricin. 



15 4.10.13 DRUG SCREENING 

This invention is particularly useful for screening chemical compounds by using the 
novel polypeptides or binding fragments thereof in any of a variety of drug screening 
techniques. The polypeptides or fragments employed in such a test may either be free in 
solution, affixed to a solid support, borne on a cell surface or located intracellularly. One 

20 method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the polypeptide or a fragment 
thereof. Drugs are screened against such transformed cells in competitive binding assays. 
Such cells, either in viable or fixed form, can be used for standard binding assays. One may 
measure, for example, the formation of complexes between polypeptides of the invention or 

25 fragments and the agent being tested or examine the diminution in complex formation 

between the novel polypeptides and an appropriate cell line, which are well known in the art. 

Sources for test compounds that may be screened for ability to bind to or modulate 
(i.e., increase or decrease) the activity of polypeptides of the invention include (1) inorganic 
and organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 

30 comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 

Chemical libraries may be readily synthesized or purchased from a number of 
commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 
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The sources of natural product libraries are microorganisms (including bacteria and 
fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 
screening may be created by: (1) fermentation and extraction of broths from soil, plant or 
marine microorganisms or (2) extraction of the organisms themselves. Natural product 
5 libraries include polyketides, non-ribosomal peptides, and (non-naturally occurring) variants 
thereof For a review, see Science 282:63-68 (1998). 

Combinatorial libraries are composed of large numbers of peptides, oligonucleotides 
or organic compounds and can be readily prepared by traditional automated synthesis 
methods, PGR, cloning or proprietary synthetic methods. Of particular interest are peptide 

10 and oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, 
protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide 
libraries. For a review of combinatorial chemistry and libraries created therefrom, see 
Myers, Curr. Opin. BiotechnoL 8:701-707 (1997). For reviews and examples of 
peptidomimetic libraries, see Al-Obeidi et al., MoL Biotechnol, 9(3):205-23 (1998); Hruby 

15 et al., Curr Opin Chem Biol, 1(1):114-19 (1997); Dorner et al., BioorgMed Chem, 
4(5):709-15 (1996) (alkylated dipeptides). 

Identification of modulators through use of the various libraries described herein 
permits modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" 
to bind a polypeptide of the invention. The molecules identified in the binding assay are then 

20 tested for antagonist or agonist activity in in vivo tissue culture or animal models that are 
well known in the art. In brief, the molecules are titrated into a plurality of cell cultures or 
animals and then tested for either cell/animal death or prolonged survival of the animal/cells. 

The binding molecules thus identified may be complexed with toxins, e.g., ricin or 
cholera, or with other compounds that are toxic to cells such as radioisotopes. The 

25 toxin-binding molecule complex is then targeted to a tumor or other cell by the specificity of 
the binding molecule for a polypeptide of the invention. Alternatively, the binding 
molecules may be complexed with imaging agents for targeting and imaging purposes. 

4.10.14 ASSAY FOR RECEPTOR ACTIVITY 

30 The invention also provides methods to detect specific binding of a polypeptide e.g. a 

ligand or a receptor. The art provides numerous assays particularly useful for identifying 
previously unknown binding partners for receptor polypeptides of the invention. For 
example, expression cloning using mammalian or bacterial cells, or dihybrid screening 
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assays can be used to identify polynucleotides encoding binding partners. As another 
example, affinity chromatography with the appropriate immobilized polypeptide of the 
invention can be used to isolate polypeptides that recognize and bind polypeptides of the 
invention. There are a number of different libraries used for the identification of 
5 compounds, and in particular small molecules, that modulate {i.e., increase or decrease) 

biological activity of a polypeptide of the invention. Ligands for receptor polypeptides of the 
invention can also be identified by adding exogenous ligands, or cocktails of ligands to two 
cells populations that are genetically identical except for the expression of the receptor of the 
invention: one cell population expresses the receptor of the invention whereas the other does 

10 not. The responses of the two cell populations to the addition of ligands(s) are then 

compared. Alternatively, an expression library can be co-expressed with the polypeptide of 
the invention in cells and assayed for an autocrine response to identify potential ligand(s). As 
still another example, BIAcore assays, gel overlay assays, or other methods known in the art 
can be used to identify binding partner polypeptides, including, (1) organic and inorganic 

15 chemical libraries, (2) natural product libraries, and (3) combinatorial libraries comprised of 
random peptides, oligonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules in the signaling cascade of 
the polypeptide of the invention can be determined. For example, a chimeric protein in 
which the cytoplasmic domain of the polypeptide of the invention is fused to the 

20 extracellular portion of a protein, whose ligand has been identified, is produced in a host 
cell. The cell is then incubated with the ligand specific for the extracellular portion of the 
chimeric protein, thereby activating the chimeric receptor. Known downstream proteins 
involved in intracellular signaling can then be assayed for expected modifications i.e. 
phosphorylation. Other methods known to those in the art can also be used to identify 

25 signaling molecules involved in receptor activity. 

4.10.15 ANTI-INFLAMMATORY ACTIVITY 

Compositions of the present invention may also exhibit anti-inflammatory activity. 
The anti-inflammatory activity may be achieved by providing a stimulus to cells involved in 
30 the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for 
example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 
inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or 
suppressing production of other factors which more directly inhibit or promote an 
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inflammatory response. Compositions with such activities can be used to treat 
inflammatory conditions including chronic or acute conditions), including without limitation 
intimation associated with infection (such as septic shock, sepsis or systemic inflammatory 
response syndrome (SIRS)), ischemia-rep er fusion injury, endotoxin lethality, arthritis, 
5 complement-mediated hyperacute rejection, nephritis, cytokine or chemokine-induced lung 
injury, inflammatory bowel disease, Crohn's disease or resulting from over production of 
cytokines such as TNF or IL-1. Compositions of the invention may also be useful to treat 
anaphylaxis and hypersensitivity to an antigenic substance or material. Compositions of this 
invention may be utilized to prevent or treat conditions such as, but not limited to, sepsis, 

10 acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid arthritis, chronic 
inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, graft versus 
host disease, inflammatory bowel disease, inflamation associated with pulmonary disease, 
other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 
acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 

15 intrauterine infections. 

4.10.16 LEUKEMIAS 

Leukemias and related disorders may be treated or prevented by administration of a 
therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of 
20 the invention. Such leukemias and related disorders include but are not limited to acute 
leukemia, acute lymphocytic leukemia, acute myelocytic leukemia, myeloblastic, 
promyelocytic, myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic 
myelocytic (granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such 
disorders, see Fishman et ah, 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 

25 

4.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication of 
30 therapeutic utility, include but are not limited to nervous system injuries, and diseases or 
disorders which result in either a disconnection of axons, a diminution or degeneration of 
neurons, or demyelination. Nervous system lesions which may be treated in a patient 
(including human and non-human mammalian patients) according to the invention include 
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but are not limited to the following lesions of either the central (including spinal cord, 
brain) or peripheral nervous systems: 

(i) traumatic lesions, including lesions caused by physical injury or associated 
with surgery, for example, lesions which sever a portion of the nervous system, or 
compression injuries; 

(ii) . ischemic lesions, in which a lack of oxygen in a portion of the nervous system 
results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 
infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by 
human immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme 
disease, tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration 
associated with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or 
amyotrophic lateral sclerosis; 

(v) lesions associated with nutritional diseases or disorders, in which a portion of 
the nervous system is destroyed or injured by a nutritional disorder or disorder of 
metabolism including but not limited to, vitamin B12 deficiency, folic acid deficiency, 
Wernicke disease, tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary 
degeneration of the corpus callosum), and alcoholic cerebellar degeneration; 

(vi) neurological lesions associated with systemic diseases including but not 
limited to diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, 
carcinoma, or sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 
neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is destroyed or 
injured by a demyelinating disease including but not limited to multiple sclerosis, human 
immunodeficiency virus-associated myelopathy, transverse myelopathy or various 
etiologies, progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 

Therapeutics which are useful according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival 
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or differentiation of neurons. For example, and not by way of limitation, therapeutics 
which elicit any of the following effects may be useful according to the invention: 

(i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, 
e.g., choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 

(iv) decreased symptoms of neuron dysfunction in vivo. 

Such effects may be measured by any method known in the art. In preferred, 
non- limiting embodiments, increased survival of neurons may be measured by the method 
set forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons 
may be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or 
Brown et al. (1981, Ann. Rev. Neurosci. 4:17-42); increased production of 
neuron-associated molecules may be measured by bioassay, enzymatic assay, antibody 
binding, Northern blot assay, etc., depending on the molecule to be measured; and motor 
neuron dysfunction may be measured by assessing the physical manifestation of motor 
neuron disorder, e.g., weakness, motor neuron conduction velocity, or functional disability. 

In specific embodiments, motor neuron disorders that may be treated according to the 
invention include but are not limited to disorders such as infarction, infection, exposure to 
toxin, trauma, surgical damage, degenerative disease or malignancy that may affect motor 
neurons as well as other components of the nervous system, as well as disorders that 
selectively affect neurons such as amyotrophic lateral sclerosis, and including but not limited 
to progressive spinal muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, 
infantile and juvenile muscular atrophy, progressive bulbar paralysis of childhood (Fazio- 
Londe syndrome), poliomyelitis and the post polio syndrome, and Hereditary Motorsensory 
Neuropathy (Charcot-Marie-Tooth Disease). 

4.10.18 OTHER ACTIVITIES 

A polypeptide of the invention may also exhibit one or more of the following 
additional activities or effects: inhibiting the growth, infection or function of, or killing, 
infectious agents, including, without limitation, bacteria, viruses, fungi and other parasites; 
effecting (suppressing or enhancing) bodily characteristics, including, without limitation, 
height, weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or 
organ or body part size or shape (such as, for example, breast augmentation or diminution, 
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change in bone form or shape); effecting biorhythms or circadian cycles or rhythms; 
effecting the fertility of male or female subjects; effecting the metabolism, catabolism, 
anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, 
carbohydrate, vitamins, minerals, co-factors or other nutritional factors or component(s); 
5 effecting behavioral characteristics, including, without limitation, appetite, libido, stress, 
cognition (including cognitive disorders), depression (including depressive disorders) and 
violent behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and growth of embryonic stem cells in lineages other than hematopoietic 
lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies of 
10 the enzyme and treating deficiency-related diseases; treatment of hyperproliferative 
disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for 
example, the ability to bind antigens or complement); and the ability to act as an antigen in a 
vaccine composition to raise an immune response against such protein or another material or 
entity which is cross-reactive with such protein. 

15 

4.10.19 IDENTIFICATION OF POLYMORPHISMS 

The demonstration of polymorphisms makes possible the identification of such 
polymorphisms in human subjects and the pharmacogenetic use of this information for 
diagnosis and treatment. Such polymorphisms may be associated with, e.g., differential 

20 predisposition or susceptibility to various disease states (such as disorders involving 

inflammation or immune response) or a differential response to drug administration, and this 
genetic information can be used to tailor preventive or therapeutic treatment appropriately. 
For example, the existence of a polymorphism associated with a predisposition to 
inflammation or autoimmune disease makes possible the diagnosis of this condition in 

25 humans by identifying the presence of the polymorphism. 

Polymorphisms can be identified in a variety of ways known in the art which all 
generally involve obtaining a sample from a patient, analyzing DNA from the sample, 
optionally involving isolation or amplification of the DNA, and identifying the presence of 
the polymorphism in the DNA. For example, PCR may be used to amplify an appropriate 

30 fragment of genomic DNA which may then be sequenced. Alternatively, the DNA may be 
subjected to allele-specific oligonucleotide hybridization (in which appropriate 
oligonucleotides are hybridized to the DNA under conditions permitting detection of a single 
base mismatch) or to a single nucleotide extension assay (in which an oligonucleotide that 
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hybridizes immediately adjacent to the position of the polymorphism is extended with one 
or more labeled nucleotides). In addition, traditional restriction fragment length 
polymorphism analysis (using restriction enzymes that provide differential digestion of the 
genomic DNA depending on the presence or absence of the polymorphism) may be 
5 performed. Arrays with nucleotide sequences of the present invention can be used to detect 
polymorphisms. The array can comprise modified nucleotide sequences of the present 
invention in order to detect the nucleotide sequences of the present invention. In the 
alternative, any one of the nucleotide sequences of the present invention can be placed on the 
array to detect changes from those sequences. 
10 Alternatively a polymorphism resulting in a change in the amino acid sequence could 

also be detected by detecting a corresponding change in amino acid sequence of the protein, 
e-g*> by an antibody specific to the variant sequence. 

4.10.20 ARTHRITIS AND INFLAMMATION 

15 The immunosuppressive effects of the compositions of the invention against 

rheumatoid arthritis is determined in an experimental animal model system. The 
experimental model system is adjuvant induced arthritis in rats, and the protocol is described 
by J. Holoshitz, et at, 1983, Science, 219:56, or by B. Waksman et al., 1963, Int. Arch. 
Allergy Appl. Immunol., 23:129. Induction of the disease can be caused by a single 

20 injection, generally intradermally, of a suspension of killed Mycobacterium tuberculosis in 
complete Freund's adjuvant (CFA). The route of injection can vary, but rats may be injected 
at the base of the tail with an adjuvant mixture. The polypeptide is administered in phosphate 
buffered solution (PBS) at a dose of about 1-5 mg/kg. The control consists of administering 
PBS only. 

25 The procedure for testing the effects of the test compound would consist of 

intradermally injecting killed Mycobacterium tuberculosis in CFA followed by immediately 
administering the test compound and subsequent treatment every other day until day 24. At 
14, 15, 18, 20, 22, and 24 days after injection of Mycobacterium CFA, an overall arthritis 
score may be obtained as described by J. Holoskitz above. An analysis of the data would 

30 reveal that the test compound would have a dramatic affect on the swelling of the joints as 
measured by a decrease of the arthritis score. 



4.11 THERAPEUTIC METHODS 
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The compositions (including polypeptide fragments, analogs, variants and 
antibodies or other binding partners or modulators including antisense polynucleotides) of 
the invention have numerous applications in a variety of therapeutic methods. Examples of 
therapeutic applications include, but are not limited to, those exemplified herein. 

5 

4.11.1 EXAMPLE 

One embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals affected by a disease or 
disorder that can be modulated by regulating the peptides of the invention. While the mode 

10 of administration is not particularly important, parenteral administration is preferred. An 
exemplary mode of administration is to deliver an intravenous bolus. The dosage of the 
polypeptides or other composition of the invention will normally be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to the age, 
weight, condition and response of the individual patient. Typically, the amount of 

15 polypeptide administered per dose will be in the range of about 0.01 jug/kg to 100 mg/kg of 
body weight, with the preferred dose being about O.ljag/kg to 10 mg/kg of patient body 
weight. For parenteral administration, polypeptides of the invention will be formulated in an 
injectable form combined with a pharmaceutically acceptable parenteral vehicle. Such 
vehicles are well known in the art and examples include water, saline, Ringer's solution, 

20 dextrose solution, and solutions consisting of small amounts of the human serum albumin. 
The vehicle may contain minor amounts of additives that maintain the isotonicity and 
stability of the polypeptide or other active ingredient. The preparation of such solutions is 
within the skill of the art. 

25 4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 

ADMINISTRATION 

A protein or other composition of the present invention (from whatever source 
derived, including without limitation from recombinant and non-recombinant sources and 
including antibodies and other binding partners of the polypeptides of the invention) may be 
30 administered to a patient in need, by itself, or in pharmaceutical compositions where it is 
mixed with suitable carriers or excipient(s) at doses to treat or ameliorate a variety of 
disorders. Such a composition may optionally contain (in addition to protein or other active 
ingredient and a carrier) diluents, fillers, salts, buffers, stabilizers, solubilizers, and other 
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materials well known in the art. The term "pharmaceutically acceptable" means a non-toxic 
material that does not interfere with the effectiveness of the biological activity of the active 
ingredient(s). The characteristics of the carrier will depend on the route of administration. 
The pharmaceutical composition of the invention may also contain cytokines, lymphokines, 
or other hematopoietic factors such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, 
IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IFN, TNFO, TNF1, TNF2, 
G-CSF, Meg-CSF, thrombopoietin, stem cell factor, and erythropoietin. In further 
compositions, proteins of the invention may be combined with other agents beneficial to the 
treatment of the disease or disorder in question. These agents include various growth factors 
such as epidermal growth factor (EGF), platelet-derived growth factor (PDGF), transforming 
growth factors (TGF-oc and TGF-P), insulin-like growth factor (IGF), as well as cytokines 
described herein. 

The pharmaceutical composition may further contain other agents which either 
enhance the activity of the protein or other active ingredient or complement its activity or 
use in treatment. Such additional factors and/or agents may be included in the 
pharmaceutical composition to produce a synergistic effect with protein or other active 
ingredient of the invention, or to minimize side effects. Conversely, protein or other active 
ingredient of the present invention may be included in formulations of the particular clotting 
factor, cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic 
factor, or anti- inflammatory agent to minimize side effects of the clotting factor, cytokine, 
lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or 
anti-inflammatory agent (such as IL-IRa, IL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, 
immunosuppressive agents). A protein of the present invention may be active in multimers 
(e.g., heterodimers or homodimers) or complexes with itself or other proteins. As a result, 
pharmaceutical compositions of the invention may comprise a protein of the invention in 
such multimeric or complexed form. 

As an alternative to being included in a pharmaceutical composition of the invention 
including a first protein, a second protein or a therapeutic agent may be concurrently 
administered with the first protein (e.g., at the same time, or at differing times provided that 
therapeutic concentrations of the combination of agents is achieved at the treatment site). 
Techniques for formulation and administration of the compounds of the instant application 
may be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, 
latest edition. A therapeutically effective dose further refers to that amount of the compound 
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sufficient to result in amelioration of symptoms, e.g., treatment, healing, prevention or 
amelioration of the relevant medical condition, or an increase in rate of treatment, healing, 
prevention or amelioration of such conditions. When applied to an individual active 
ingredient, administered alone, a therapeutically effective dose refers to that ingredient 
5 alone. When applied to a combination, a therapeutically effective dose refers to combined 
amounts of the active ingredients that result in the therapeutic effect, whether administered 
in combination, serially or simultaneously. 

In practicing the method of treatment or use of the present invention, a 
therapeutically effective amount of protein or other active ingredient of the present invention 

10 is administered to a mammal having a condition to be treated. Protein or other active 

ingredient of the present invention may be administered in accordance with the method of 
the invention either alone or in combination with other therapies such as treatments 
employing cytokines, lymphokines or other hematopoietic factors. When co- administered 
with one or more cytokines, lymphokines or other hematopoietic factors, protein or other 

1 5 active ingredient of the present invention may be administered either simultaneously with 
the cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or 
anti-thrombotic factors, or sequentially. If administered sequentially, the attending physician 
will decide on the appropriate sequence of administering protein or other active ingredient of 
the present invention in combination with cytokine(s), lymphokine(s), other hematopoietic 

20 factor(s), thrombolytic or anti-thrombotic factors. 

4.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, 
transmucosal, or intestinal administration; parenteral delivery, including intramuscular, 

25 subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, 

intravenous, intraperitoneal, intranasal, or intraocular injections. Administration of protein 
or other active ingredient of the present invention used in the pharmaceutical composition or 
to practice the method of the present invention can be carried out in a variety of conventional 
ways, such as oral ingestion, inhalation, topical application or cutaneous, subcutaneous, 

30 intraperitoneal, parenteral or intravenous injection. Intravenous administration to the patient 
is preferred. 

Alternately, one may administer the compound in a local rather than systemic 
manner, for example, via injection of the compound directly into a arthritic joints or in 
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fibrotic tissue, often in a depot or sustained release formulation. In order to prevent the 
scarring process frequently occurring as complication of glaucoma surgery, the compounds 
may be administered topically, for example, as eye drops. Furthermore, one may administer 
the drug in a targeted drug delivery system, for example, in a liposome coated with a specific 
5 antibody, targeting, for example, arthritic or fibrotic tissue. The liposomes will be targeted 
to and taken up selectively by the afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an 
effective dosage to the desired site of action. The determination of a suitable route of 
administration and an effective dosage for a particular indication is within the level of skill 
10 in the art. Preferably for wound treatment, one administers the therapeutic compound 
directly to the site. Suitable dosage ranges for the polypeptides of the invention can be 
extrapolated from these dosages or from similar studies in appropriate animal models. 
Dosages can then be adjusted as necessary by the clinician to provide maximal therapeutic 
benefit. 

4.12.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention thus 
may be formulated in a conventional manner using one or more physiologically acceptable 
earners comprising excipients and auxiliaries which facilitate processing of the active 

20 compounds into preparations which can be used pharmaceutically. These pharmaceutical 
compositions may be manufactured in a manner that is itself known, e.g., by means of 
conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, 
encapsulating, entrapping or lyophilizing processes. Proper formulation is dependent upon 
the route of administration chosen. When a therapeutically effective amount of protein or 

25 other active ingredient of the present invention is administered orally, protein or other active 
ingredient of the present invention will be in the form of a tablet, capsule, powder, solution 
or elixir. When administered in tablet form, the pharmaceutical composition of the invention 
may additionally contain a solid carrier such as a gelatin or an adjuvant. The tablet, capsule, 
and powder contain from about 5 to 95% protein or other active ingredient of the present 

30 invention, and preferably from about 25 to 90% protein or other active ingredient of the 
present invention. When administered in liquid form, a liquid carrier such as water, 
petroleum, oils of animal or plant origin such as peanut oil, mineral oil, soybean oil, or 
sesame oil, or synthetic oils may be added. The liquid form of the pharmaceutical 
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composition may further contain physiological saline solution, dextrose or other saccharide 
solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. When 
administered in liquid form, the pharmaceutical composition contains from about 0.5 to 90% 
by weight of protein or other active ingredient of the present invention, and preferably from 
5 about 1 to 50% protein or other active ingredient of the present invention. 

When a therapeutically effective amount of protein or other active ingredient of the 
present invention is administered by intravenous, cutaneous or subcutaneous injection, 
protein or other active ingredient of the present invention will be in the form of a 
pyrogen- free, parenterally acceptable aqueous solution. The preparation of such parenterally 

10 acceptable protein or other active ingredient solutions, having due regard to pH, isotonicity, 
stability, and the like, is within the skill in the art. A preferred pharmaceutical composition 
for intravenous, cutaneous, or subcutaneous injection should contain, in addition to protein 
or other active ingredient of the present invention, an isotonic vehicle such as Sodium 
Chloride Injection, Ringer ? s Injection, Dextrose Injection, Dextrose and Sodium Chloride 

15 Injection, Lactated Ringer's Injection, or other vehicle as known in the art. The 
pharmaceutical composition of the present invention may also contain stabilizers, 
preservatives, buffers, antioxidants, or other additives known to those of skill in the art. For 
injection, the agents of the invention may be formulated in aqueous solutions, preferably in 
physiologically compatible buffers such as Hanks f s solution, Ringer's solution, or 

20 physiological saline buffer. For transmucosal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. Such penetrants are generally known in 
the art. 

For oral administration, the compounds can be formulated readily by combining the 
active compounds with pharmaceutically acceptable carriers well known in the art. Such 

25 carriers enable the compounds of the invention to be formulated as tablets, pills, dragees, 
capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a 
patient to be treated. Pharmaceutical preparations for oral use can be obtained from a solid 
excipient, optionally grinding a resulting mixture, and processing the mixture of granules, 
after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable 

30 excipients are, in particular, fillers such as sugars, including lactose, sucrose, mannitol, or 
sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, 
potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, 
sodium carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, 
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disintegrating agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or 
alginic acid or a salt thereof such as sodium alginate. Dragee cores are provided with 
suitable coatings. For this purpose, concentrated sugar solutions may be used, which may 
optionally contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, 
5 and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. 
Dyestuffs or pigments may be added to the tablets or dragee coatings for identification or to 
characterize different combinations of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made 
of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol 

10 or sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler 
such as lactose, binders such as starches, and/or lubricants such as talc or magnesium 
stearate and, optionally, stabilizers. In soft capsules, the active compounds may be dissolved 
or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene 
glycols. In addition, stabilizers may be added. All formulations for oral administration 

15 should be in dosages suitable for such administration. For buccal administration, the 

compositions may take the form of tablets or lozenges formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 

20 dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide 
or other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined 
by providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin 
for use in an inhaler or insufflator may be formulated containing a powder mix of the 
compound and a suitable powder base such as lactose or starch. The compounds may be 

25 formulated for parenteral administration by injection, e.g. , by bolus injection or continuous 
infusion. Formulations for injection may be presented in unit dosage form, e.g., in ampules 
or in multi-dose containers, with an added preservative. The compositions may take such 
forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain 
formulatory agents such as suspending, stabilizing and/or dispersing agents. 

30 Pharmaceutical formulations for parenteral administration include aqueous solutions 

of the active compounds in water-soluble form. Additionally, suspensions of the active 
compounds may be prepared as appropriate oily injection suspensions. Suitable lipophilic 
solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such 
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as ethyl oleate or triglycerides, or liposomes. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, such as sodium carboxymethyl 
cellulose, sorbitol, or dextran. Optionally, the suspension may also contain suitable 
stabilizers or agents which increase the solubility of the compounds to allow for the 
5 preparation of highly concentrated solutions. Alternatively, the active ingredient may be in 
powder form for constitution with a suitable vehicle, e.g., sterile pyrogen- free water, before 
use. 

The compounds may also be formulated in rectal compositions such as suppositories 
or retention enemas, e.g. , containing conventional suppository bases such as cocoa butter or 

1 0 other glycerides. In addition to the formulations described previously, the compounds may 
also be formulated as a depot preparation. Such long acting formulations may be 
administered by implantation (for example subcutaneously or intramuscularly) or by 
intramuscular injection. Thus, for example, the compounds may be formulated with suitable 
polymeric or hydrophobic materials (for example as an emulsion in an acceptable oil) or ion 

1 5 exchange resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds of the invention is a co- 
solvent system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic 
polymer, and an aqueous phase. The co-solvent system may be the VPD co-solvent system. 
VPD is a solution of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 

20 80, and 65% w/v polyethylene glycol 300, made up to volume in absolute ethanol. The VPD 
co-solvent system (VPD:5W) consists of VPD diluted 1:1 with a 5% dextrose in water 
solution. This co-solvent system dissolves hydrophobic compounds well, and itself produces 
low toxicity upon systemic administration. Naturally, the proportions of a co-solvent system 
may be varied considerably without destroying its solubility and toxicity characteristics. 

25 Furthermore, the identity of the co-solvent components may be varied: for example, other 

low-toxicity nonpolar surfactants may be used instead of polysorbate 80; the fraction size of 
polyethylene glycol may be varied; other biocompatible polymers may replace polyethylene 
glycol, e.g. polyvinyl pyrrolidone; and other sugars or polysaccharides may substitute for 
dextrose. Alternatively, other delivery systems for hydrophobic pharmaceutical compounds 

30 may be employed. Liposomes and emulsions are well known examples of delivery vehicles 
or carriers for hydrophobic drugs. Certain organic solvents such as dimethylsulfoxide also 
may be employed, although usually at the cost of greater toxicity. Additionally, the 
compounds may be delivered using a sustained-release system, such as semipermeable 
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matrices of solid hydrophobic polymers containing the therapeutic agent. Various types of 
sustained-release materials have been established and are well known by those skilled in the 
art. Sustained-release capsules may, depending on their chemical nature, release the 
compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 
5 biological stability of the therapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase 
carriers or excipients. Examples of such carriers or excipients include but are not limited to 
calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, 

10 gelatin, and polymers such as polyethylene glycols. Many of the active ingredients of the 
invention may be provided as salts with pharmaceutically compatible counter ions. Such 
pharmaceutically acceptable base addition salts are those salts which retain the biological 
effectiveness and properties of the free acids and which are obtained by reaction with 
inorganic or organic bases such as sodium hydroxide, magnesium hydroxide, ammonia, 

15 trialkylamine, dialkylamine, monoalkylamine, dibasic amino acids, sodium acetate, 
potassium benzoate, triethanol amine and the like. 

The pharmaceutical composition of the invention may be in the form of a complex of 
the protein(s) or other active ingredient(s) of present invention along with protein or peptide 
antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 

20 lymphocytes. B lymphocytes will respond to antigen through their surface immunoglobulin 
receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) 
following presentation of the antigen by MHC proteins. MHC and structurally related 
proteins including those encoded by class I and class II MHC genes on host cells will serve 
to present the peptide antigen(s) to T lymphocytes. The antigen components could also be 

25 supplied as purified MHC-peptide complexes alone or with co-stimulatory molecules that 
can directly signal T cells. Alternatively antibodies able to bind surface immunoglobulin 
and other molecules on B cells as well as antibodies able to bind the TCR and other 
molecules on T cells can be combined with the pharmaceutical composition of the invention. 
The pharmaceutical composition of the invention may be in the form of a liposome in 

30 which protein of the present invention is combined, in addition to other pharmaceutically 

acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. 
Suitable lipids for liposomal formulation include, without limitation, monoglycerides, 
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diglycerides, sulfatides, lysolecithins 5 phospholipids, saponin, bile acids, and the like. 
Preparation of such liposomal formulations is within the level of skill in the art, as disclosed, 
for example, in U.S. Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of 
which are incorporated herein by reference. 
5 The amount of protein or other active ingredient of the present invention in the 

pharmaceutical composition of the present invention will depend upon the nature and 
severity of the condition being treated, and on the nature of prior treatments which the 
patient has undergone. Ultimately, the attending physician will decide the amount of protein 
or other active ingredient of the present invention with which to treat each individual patient. 

1 0 Initially, the attending physician will administer low doses of protein or other active 
ingredient of the present invention and observe the patient's response. Larger doses of 
protein or other active ingredient of the present invention maybe administered until the 
optimal therapeutic effect is obtained for the patient, and at that point the dosage is not 
increased farther. It is contemplated that the various pharmaceutical compositions used to 

15 practice the method of the present invention should contain about 0.01 fig to about 100 mg 
(preferably about 0.1 jug to about 10 mg, more preferably about 0.1 )Lig to about 1 mg) of 
protein or other active ingredient of the present invention per kg body weight. For 
compositions of the present invention which are useful for bone, cartilage, tendon or 
ligament regeneration, the therapeutic method includes administering the composition 

20 topically, systematically, or locally as an implant or device. When administered, the 
therapeutic composition for use in this invention is, of course, in a pyrogen-free, 
physiologically acceptable form. Further, the composition may desirably be encapsulated or 
injected in a viscous form for delivery to the site of bone, cartilage or tissue damage. 
Topical administration may be suitable for wound healing and tissue repair. Therapeutically 

25 useful agents other than a protein or other active ingredient of the invention which may also 
optionally be included in the composition as described above, may alternatively or 
additionally, be administered simultaneously or sequentially with the composition in the 
methods of the invention. Preferably for bone and/or cartilage formation, the composition 
would include a matrix capable of delivering the protein-containing or other active 

30 ingredient-containing composition to the site of bone and/or cartilage damage, providing a 
structure for the developing bone and cartilage and optimally capable of being resorbed into 
the body. Such matrices may be formed of materials presently in use for other implanted 
medical applications. 
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The choice of matrix material is based on biocompatibility, biodegradability, 
mechanical properties, cosmetic appearance and interface properties. The particular 
application of the compositions will define the appropriate formulation. Potential matrices 
for the compositions may be biodegradable and chemically defined calcium sulfate, 
5 tricalcium phosphate, hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. 
Other potential materials are biodegradable and biologically well-defined, such as bone or 
dermal collagen. Further matrices are comprised of pure proteins or extracellular matrix 
components. Other potential matrices are nonbiodegradable and chemically defined, such as 
sintered hydroxyapatite, bioglass, aluminates, or other ceramics. Matrices may be comprised 

10 of combinations of any of the above-mentioned types of material, such as polylactic acid and 
hydroxyapatite or collagen and tricalcium phosphate. The bioceramics may be altered in 
composition, such as in calcium-aluminate-phosphate and processing to alter pore size, 
particle size, particle shape, and biodegradability. Presently preferred is a 50:50 (mole 
weight) copolymer of lactic acid and glycolic acid in the form of porous particles having 

15 diameters ranging from 150 to 800 microns. In some applications, it will be useful to utilize 
a sequestering agent, such as carboxymethyl cellulose or autologous blood clot, to prevent 
the protein compositions from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic materials such as 
alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 

20 ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, 

hydroxypropyl-methylcellulose, and carboxymethylcellulose, the most preferred being 
cationic salts of carboxymethylcellulose (CMC). Other preferred sequestering agents 
include hyaluronic acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, 
carboxyvinyl polymer and polyvinyl alcohol). The amount of sequestering agent useful 

25 herein is 0.5-20 wt %, preferably 1-10 wt % based on total formulation weight, which 
represents the amount necessary to prevent desorption of the protein from the polymer 
matrix and to provide appropriate handling of the composition, yet not so much that the 
progenitor cells are prevented from infiltrating the matrix, thereby providing the protein the 
opportunity to assist the osteogenic activity of the progenitor cells. In further compositions, 

30 proteins or other active ingredients of the invention may be combined with other agents 

beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in question. 
These agents include various growth factors such as epidermal growth factor (EGF), platelet 
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derived growth factor (PDGF), transforming growth factors (TGF-a and TGF-p), and 

insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary applications. 
Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 
5 patients for such treatment with proteins or other active ingredients of the present invention. 
The dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 
regeneration will be determined by the attending physician considering various factors which 
modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site 
of damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue 

10 (e.g., bone), the patient's age, sex, and diet, the severity of any infection, time of 

administration and other clinical factors. The dosage may vary with the type of matrix used 
in the reconstitution and with inclusion of other proteins in the pharmaceutical composition. 
For example, the addition of other known growth factors, such as IGF I (insulin like growth 
factor I), to the final composition, may also effect the dosage. Progress can be monitored by 

1 5 periodic assessment of tissue/bone growth and/or repair, for example, X-rays, 
histomorphometric determinations and tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other 

20 known methods for introduction of nucleic acid into a cell or organism (including, without 
limitation, in the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in 
the presence of proteins of the present invention in order to proliferate or to produce a 
desired effect on or activity in such cells. Treated cells can then be introduced in vivo for 
therapeutic purposes. 

25 

4.12.3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve 
its intended purpose. More specifically, a therapeutically effective amount means an amount 
30 effective to prevent development of or to alleviate the existing symptoms of the subject 

being treated. Determination of the effective amount is well within the capability of those 
skilled in the art, especially in light of the detailed disclosure provided herein. For any 
compound used in the method of the invention, the therapeutically effective dose can be 
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estimated initially from appropriate in vitro assays. For example, a dose can be formulated 
in animal models to achieve a circulating concentration range that can be used to more 
accurately determine useful doses in humans. For example, a dose can be formulated in 
animal models to achieve a circulating concentration range that includes the IC50 as 
5 determined in cell culture (i.e. , the concentration of the test compound which achieves a 
half-maximal inhibition of the protein's biological activity). Such information can be used 
to more accurately determine useful doses in humans. 

A therapeutically effective dose refers to that amount of the compound that results in 
amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 

1 0 efficacy of such compounds can be determined by standard pharmaceutical procedures in 
cell cultures or experimental animals, e.g., for determining the LD50 (the dose lethal to 50% 
of the population) and the ED 50 (the dose therapeutically effective in 50% of the population). 
The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
expressed as the ratio between LD50 and ED 50 . Compounds which exhibit high therapeutic 

1 5 indices are preferred. The data obtained from these cell culture assays and animal studies 
can be used in formulating a range of dosage for use in human. The dosage of such 
compounds lies preferably within a range of circulating concentrations that include the ED 50 
with little or no toxicity. The dosage may vary within this range depending upon the dosage 
form employed and the route of administration utilized. The exact formulation, route of 

20 administration and dosage can be chosen by the individual physician in view of the patient's 
condition. See, e.g., Fingl et al., 1975, in "The Pharmacological Basis of Therapeutics", Ch. 
1 p.l. Dosage amount and interval may be adjusted individually to provide plasma levels of 
the active moiety which are sufficient to maintain the desired effects, or minimal effective 
concentration (MEC). The MEC will vary for each compound but can be estimated from in 

25 vitro data. Dosages necessary to achieve the MEC will depend on individual characteristics 
and route of administration. However, HPLC assays or bioassays can be used to determine 

plasma concentrations. 

Dosage intervals can also be determined using MEC value. Compounds should be 
administered using a regimen which maintains plasma levels above the MEC for 10-90% of 
30 the time, preferably between 30-90% and most preferably between 50-90%. In cases of local 
administration or selective uptake, the effective local concentration of the drug may not be 
related to plasma concentration. 
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An exemplary dosage regimen for polypeptides or other compositions of the 
invention will be in the range of about 0.01 jxg/kg to 100 mg/kg of body weight daily, with 
the preferred dose being about 0.1 \xg/kg to 25 mg/kg of patient body weight daily, varying 
in adults and children. Dosing may be once daily, or equivalent doses may be delivered at 
5 longer or shorter intervals. 

The amount of composition administered will, of course, be dependent on the subject 
being treated, on the subject's age and weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician. 

10 4.12.4 PACKAGING 

The compositions may, if desired, be presented in a pack or dispenser device which 
may contain one or more unit dosage forms containing the active ingredient. The pack may, 
for example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser 
device may be accompanied by instructions for administration. Compositions comprising a 
1 5 compound of the invention formulated in a compatible pharmaceutical carrier may also be 
prepared, placed in an appropriate container, and labeled for treatment of an indicated 
condition. 

4.13 ANTIBODIES 

20 Also included in the invention are antibodies to proteins, or fragments of proteins of 

the invention. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 
contain an antigen-binding site that specifically binds (immunoreacts with) an antigen. Such 
antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, 

25 Fab, F ab > and F (ab ')2 fragments, and an F ab expression library. In general, an antibody molecule 
obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ 
from one another by the nature of the heavy chain present in the molecule. Certain classes 
have subclasses as well, such as Igd, IgG 2 , and others. Furthermore, in humans, the light 

r 

chain may be a kappa chain or a lambda chain. Reference herein to antibodies includes a 
30 reference to all such classes, subclasses and types of human antibody species. 

An isolated related protein of the invention may be intended to serve as an antigen, or 
a portion or fragment thereof, and additionally can be used as an immunogen to generate 
antibodies that immunospecifically bind the antigen, using standard techniques for 
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polyclonal and monoclonal antibody preparation. The Ml-length protein can be used or, 
alternatively, the invention provides antigenic peptide fragments of the antigen for use as 
immunogens. An antigenic peptide fragment comprises at least 6 amino acid residues of the 
amino acid sequence of the full length protein, such as an amino acid sequence shown in 
5 SEQ ID NO: 972-1942, or 2710-3476, or Tables 3, 4A, 4B, or 5, and encompasses an 
epitope thereof such that an antibody raised against the peptide forms a specific immune 
complex with the full length protein or with any fragment that contains the epitope. 
Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 15 
amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. 

10 Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are 
located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a surface region of the protein, e.g., a hydrophilic region. A 
hydrophobicity analysis of the human related protein sequence will indicate which regions of 

15 a related protein are particularly hydrophilic and, therefore, are likely to encode surface 
residues useful for targeting antibody production. As a means for targeting antibody 
production, hydropathy plots showing regions of hydrophilicity and hydrophobicity may be 
generated by any method well known in the art, including, for example, the Kyte Doolittle or 
the Hopp Woods methods, either with or without Fourier transformation. See, e.g., Hopp and 

20 Woods, 1981, Proc. Nat. Acad. Sci. USA 78: 3824-3828; Kyte and Doolittle 1982, J. Mol. 
Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. 
Antibodies that are specific for one or more domains within an antigenic protein, or 
derivatives, fragments, analogs or homologs thereof, are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 

25 thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

The term "specific for" indicates that the variable regions of the antibodies of the 
invention recognize and bind polypeptides of the invention exclusively (z.e., able to 
distinguish the polypeptide of the invention from other similar polypeptides despite sequence 

30 identity, homology, or similarity found in the family of polypeptides), but may also interact 
with other proteins (for example, S. aureus protein A or other antibodies in ELISA 
techniques) through interactions with sequences outside the variable region of the antibodies, 
and in particular, in the constant region of the molecule. Screening assays to determine 
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binding specificity of an antibody of the invention are well known and routinely practiced 
in the art. For a comprehensive discussion of such assays, see Harlow et al. (Eds), 
Antibodies A Laboratory Manual; Cold Spring Harbor Laboratory; Cold Spring Harbor, NY 
(1988), Chapter 6. Antibodies that recognize and bind fragments of the polypeptides of the 
5 invention are also contemplated, provided that the antibodies are first and foremost specific 
for, as defined above, full-length polypeptides of the invention. As with antibodies that are 
specific for full length polypeptides of the invention, antibodies of the invention that 
recognize fragments are those which can distinguish polypeptides from the same family of 
polypeptides despite inherent sequence identity, homology, or similarity found in the family 
10 of proteins. 

Antibodies of the invention are useful for, for example, therapeutic purposes (by 
modulating activity of a polypeptide of the invention), diagnostic purposes to detect or 
quantitate a polypeptide of the invention, as well as purification of a polypeptide of the 
invention. Kits comprising an antibody of the invention for any of the purposes described 

15 herein are also comprehended, hi general, a kit of the invention also includes a control 
antigen for which the antibody is immunospecific. The invention further provides a 
hybridoma that produces an antibody according to the invention. Antibodies of the 
invention are useful for detection and/or purification of the polypeptides of the invention. 
Monoclonal antibodies binding to the protein of the invention may be useful 

20 diagnostic agents for the immunodetection of the protein. Neutralizing monoclonal 
antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where 
abnormal expression of the protein is involved. In the case of cancerous cells or leukemic 
cells, neutralizing monoclonal antibodies against the protein may be useful in detecting and 

25 preventing the metastatic spread of the cancerous cells, which may be mediated by the 
protein. 

The labeled antibodies of the present invention can be used for in vitro, in vivo, and 
in situ assays to identify cells or tissues in which a fragment of the polypeptide of interest is 
expressed. The antibodies may also be used directly in therapies or other diagnostics. The 
30 present invention further provides the above-described antibodies immobilized on a solid 
support. Examples of such solid supports include plastics such as polycarbonate, complex 

* 

carbohydrates such as agarose and Sepharose®, acrylic resins and such as polyacrylamide 
and latex beads. Techniques for coupling antibodies to such solid supports are well known 
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in the art (Weir, DM. et al., "Handbook of Experimental Immunology" 4th Ed., Blackwell 
Scientific Publications, Oxford, England, Chapter 10 (1986); Jacoby, W.D. et ah, Meth. 
Enzym. 34 Academic Press, N.Y. (1974)). The immobilized antibodies of the present 
invention can be used for in vitro, in vivo, and in situ assays as well as for immuno-affinity 
5 purification of the proteins of the present invention. 

Various procedures known within the art may be used for the production of 
polyclonal or monoclonal antibodies directed against a protein of the invention, or against 
derivatives, fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: 
A Laboratory Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, 
1 0 Cold Spring Harbor, NY, incorporated herein by reference). Some of these antibodies are 
discussed below. 

4.13.1 POLYCLONAL ANTIBODIES 

For the production of polyclonal antibodies, various suitable host animals (e.g., 

15 rabbit, goat, mouse or other mammal) may be immunized by one or more injections with the 
native protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 
protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 
recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated 

20 to a second protein known to be immunogenic in the mammal being immunized. Examples 
of such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, 
serum albumin, bovine thyro globulin, and soybean trypsin inhibitor. The preparation can 
further include an adjuvant. Various adjuvants used to increase the immunological response 
include, but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., 

25 aluminum hydroxide), surface-active substances (e.g., lysolecithin, pluronic polyols, 

polyanions, peptides, oil emulsions, dinitrophenol, etc.), adjuvants usable in humans such as 
Bacille Calmette-Guerin and Corynebacterium parvum, or similar immunostimulatory 
agents. Additional examples of adjuvants that can be employed include MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

30 The polyclonal antibody molecules directed against the immunogenic protein can be 

isolated from the mammal (e.g., from the blood) and further purified by well known 
techniques, such as affinity chromatography using protein A or protein G, which provide 
primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific 
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antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 
immobilized on a column to purify the immune specific antibody by immuno affinity 
chromatography. Purification of immunoglobulins is discussed, for example, by D. 
Wilkinson (The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 
5 (April 17, 2000), pp. 25-28). 



4.13.2 MONOCLONAL ANTIBODIES 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as 
used herein, refers to a population of antibody molecules that contain only one molecular 

1 0 species of antibody molecule consisting of a unique light chain gene product and a unique 
heavy chain gene product. In particular, the complementarity determining regions (CDRs) 
of the monoclonal antibody are identical in all the molecules of the population. MAbs thus 
contain an antigen-binding site capable of immunoreacting with a particular epitope of the 
antigen characterized by a unique binding affinity for it. 

1 5 Monoclonal antibodies can be prepared using hybridoma methods, such as those 

described by Kohler and Milstein, Nature, 256, 495 (1975). In a hybridoma method, a 
mouse, hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies 
that will specifically bind to the immunizing agent. Alternatively, the lymphocytes can be 

20 immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof 
or a fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells 
of human origin are desired, or spleen cells or lymph node cells are used if non-human 
mammalian sources are desired. The lymphocytes are then fused with an immortalized cell 

25 line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell 

(Goding, Monoclonal Antibodies: Principles and Practice, Academic Press, (1986) pp. 59- 
103). Immortalized cell lines are usually transformed mammalian cells, particularly 
myeloma cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell 
lines are employed. The hybridoma cells can be cultured in a suitable culture medium that 

30 preferably contains one or more substances that inhibit the growth or survival of the unfused, 
immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas 
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typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high 
level expression of antibody by the selected antibody-producing cells, and are sensitive to a 
5 medium such as HAT medium. More preferred immortalized cell lines are murine myeloma 
lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, 
San Diego, California and the American Type Culture Collection, Manassas, Virginia. 
Human myeloma and mouse-human heteromyeloma cell lines also have been described for 
the production of human monoclonal antibodies (Kozbor, J. Immunol., 133:3001 (1984); 

10 Brodeur et al., Monoclonal Antibody Production Techniques and Applications, Marcel 
Dekker, Inc., New York, (1987) pp. 51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed 
for the presence of monoclonal antibodies directed against the antigen. Preferably, the 
binding specificity of monoclonal antibodies produced by the hybridoma cells is determined 

15 by immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in 
the art. The binding affinity of the monoclonal antibody can, for example, be determined by 
the Scatchard analysis of Munson and Pollard, Anal. Biochem., 107, 220 (1980). Preferably, 
antibodies having a high degree of specificity and a high binding affinity for the target 

20 antigen are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by 
limiting dilution procedures and grown by standard methods. Suitable culture media for this 
purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 
medium. Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

25 The monoclonal antibodies secreted by the subclones can be isolated or purified from 

the culture medium or ascites fluid by conventional immunoglobulin purification procedures 
such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 
electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 

30 those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of 
the invention can be readily isolated and sequenced using conventional procedures (e.g., by 
using oligonucleotide probes that are capable of binding specifically to genes encoding the 
heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as 
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a preferred source of such DNA. Once isolated, the DNA can be placed into expression 
vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster 
ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, 
to obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA 
5 also can be modified, for example, by substituting the coding sequence for human heavy and 
light chain constant domains in place of the homologous murine sequences (U.S. Patent No. 
4,816,567; Morrison, Nature 368, 812-13 (1994)) or by covalently joining to the 
immunoglobulin coding sequence all or part of the coding sequence for a non- 
immunoglobulin polypeptide. Such a non-immuno globulin polypeptide can be substituted 
10 for the constant domains of an antibody of the invention, or can be substituted for the 

variable domains of one antigen-combining site of an antibody of the invention to create a 
chimeric bivalent antibody. 

4.13.3 HUMANIZED ANTIBODIES 

1 5 The antibodies directed against the protein antigens of the invention can further 

comprise humanized antibodies or human antibodies. These antibodies are suitable for 
administration to humans without engendering an immune response by the human against 
the administered immunoglobulin. Humanized forms of antibodies are chimeric 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', 

20 F(ab')2 or other antigen-binding subsequences of antibodies) that are principally comprised 
of the sequence of a human immunoglobulin, and contain minimal sequence derived from a 
non-human immunoglobulin. Humanization can be performed following the method of 
Winter and co-workers (Jones et al., Nature, 321, 522-525 (1986); Riechmann et al., Nature, 
332, 323-327 (1988); Verhoeyen et al., Science, 239, 1534-1536 (1988)), by substituting 

25 rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. (See 
also U.S. Patent No. 5,225,539). In some instances, Fv framework residues of the human 
immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
can also comprise residues that are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, the humanized antibody will comprise 

30 substantially all of at least one, and typically two, variable domains, in which all or 

substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework regions are those of a human immunoglobulin 
consensus sequence. The humanized antibody optimally also will comprise at least a portion 
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of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin 
(Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. Struct. Biol., 2, 593-596 
(1992)). 

5 4.13.4 HUMAN ANTIBODIES 

Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from 
human genes. Such antibodies are termed "human antibodies", or "fully human antibodies" 
herein. Human monoclonal antibodies can be prepared by the trioma technique; the human 

10 B-cell hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV 
hybiidoma technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human 
monoclonal antibodies may be utilized in the practice of the present invention and may be 
produced by using human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80, 

1 5 2026-2030) or by transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et 
al., 1985 In: Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol., 227, 381 (1991); 
Marks et al., J. Mol. Biol., 222:581 (1991)). Similarly, human antibodies can be made by 

20 introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire. 
This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 

25 5,569,825; 5,625,126; 5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 779- 
783 (1992)); Lonberg et al. (Nature 368, 856-859 (1994)); Morrison (Nature 368, 812-13 

(1994) ); Fishwild et al, (Nature Biotechnology 14, 845-51 (1996)); Neuberger (Nature 
Biotechnology 14, 826 (1996)); and Lonberg and Huszar (Intern. Rev. Immunol. 13, 65-93 

(1995) ). 

30 Human antibodies may additionally be produced using transgenic nonhuman animals 

« 

that are modified so as to produce fully human antibodies rather than the animal's 
endogenous antibodies in response to challenge by an antigen. (See PCT publication 
WO94/02602). The endogenous genes encoding the heavy and light immunoglobulin chains 
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in the nonhuman host have been incapacitated, and active loci encoding human heavy and 
light chain immunoglobulins are inserted into the host's genome. The human genes are 
incorporated, for example, using yeast artificial chromosomes containing the requisite 
human DNA segments. An animal which provides all the desired modifications is then 
5 obtained as progeny by crossbreeding intermediate transgenic animals containing fewer than 
the full complement of the modifications. The preferred embodiment of such a nonhuman 
animal is a mouse, and is termed the Xenomouse™ as disclosed in PCT publications WO 
96/33735 and WO 96/34096. This animal produces B cells that secrete fully human 
immunoglobulins. The antibodies can be obtained directly from the animal after 

10 immunization with an immunogen of interest, as, for example, a preparation of a polyclonal 
antibody, or alternatively from immortalized B cells derived from the animal, such as 
hybridomas producing monoclonal antibodies. Additionally, the genes encoding the 
immunoglobulins with human variable regions can be recovered and expressed to obtain the 
antibodies directly, or can be further modified to obtain analogs of antibodies such as, for 

15 example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, 
lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. 
Patent No. 5,939,598. It can be obtained by a method including deleting the J segment genes 
from at least one endogenous heavy chain locus in an embryonic stem cell to prevent 

20 rearrangement of the locus and to prevent formation of a transcript of a rearranged 
immunoglobulin heavy chain locus, the deletion being effected by a targeting vector 
containing a gene encoding a selectable marker; and producing from the embryonic stem cell 
a transgenic mouse whose somatic and germ cells contain the gene encoding the selectable 
marker. 

25 A method for producing an antibody of interest, such as a human antibody, is 

disclosed in U.S. Patent No. 5,916,771 . It includes introducing an expression vector that 
contains a nucleotide sequence encoding a heavy chain into one mammalian host cell in 
culture, introducing an expression vector containing a nucleotide sequence encoding a light 
chain into another mammalian host cell, and fusing the two cells to form a hybrid cell. The 

30 hybrid cell expresses an antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically 
relevant epitope on an immunogen, and a correlative method for selecting an antibody that 
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binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT 
publication WO 99/53049. 

4.13.5 FAB FRAGMENTS AND SINGLE CHAIN ANTIBODIES 

5 According to the invention, techniques can be adapted for the production of 

single-chain antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent 
No. 4,946,778). In addition, methods can be adapted for the construction of F a b expression 
libraries (see e.g., Huse, et aL, 1989 Science 246, 1275-1281) to allow rapid and effective 
identification of monoclonal F ab fragments with the desired specificity for a protein or 

10 derivatives, fragments, analogs or homologs thereof. Antibody fragments that contain the 

idiotypes to a protein antigen may be produced by techniques known in the art including, but 
not limited to: (i) an F (ab ')2 fragment produced by pepsin digestion of an antibody molecule; 
(ii) an F ab fragment generated by reducing the disulfide bridges of an F (ab ')2 fragment; (iii) an 
F ab fragment generated by the treatment of the antibody molecule with papain and a reducing 

15 agent and (iv) F v fragments. 

4.13.6 BISPECIFIC ANTIBODIES 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies 
that have binding specificities for at least two different antigens. In the present case, one of 
20 the binding specificities is for an antigenic protein of the invention. The second binding 
target is any other antigen, and advantageously is a cell-surface protein or receptor or 
receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 

25 immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature, 305, 537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) 
produce a potential mixture often different antibody molecules, of which only one has the 
correct bispecific structure. The purification of the correct molecule is usually accomplished 

30 by affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, 
published 13 May 1993, and in Traunecker et aL, 1991 EMBOJ., 10, 3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
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preferably is with an immunoglobulin heavy-chain constant domain, comprising at least 
part of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain 
constant region (CHI) containing the site necessary for light-chain binding present in at least 
one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, 
5 the immunoglobulin light chain, are inserted into separate expression vectors, and are co- 
transfected into a suitable host organism. For further details of generating bispecific 
antibodies see, for example, Suresh et al., Methods in Enzymology, 121, 210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a 
pair of antibody molecules can be engineered to maximize the percentage of heterodimers 
1 0 that are recovered from recombinant cell culture. The preferred interface comprises at least 
a part of the CH3 region of an antibody constant domain. In this method, one or more small 
amino acid side chains from the interface of the first antibody molecule are replaced with 
larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or 
similar size to the large side chain(s) are created on the interface of the second antibody 
15 molecule by replacing large amino acid side chains with smaller ones (e.g. alanine or 

threonine). This provides a mechanism for increasing the yield of the heterodimer over other 
unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full-length antibodies or antibody fragments 
(e.g. F(ab') 2 bispecific antibodies). Techniques for generating bispecific antibodies from 
20 antibody fragments have been described in the literature. For example, bispecific antibodies 
can be prepared using chemical linkage. Brennan et al., Science 229, 81 (1985) describe a 
procedure wherein intact antibodies are proteolytically cleaved to generate F(ab') 2 
fragments. These fragments are reduced in the presence of the dithiol complexing agent 
sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide formation. 
25 The Fab' fragments generated are then converted to thionitrobenzoate (TNB) derivatives. 
One of the Fab 5 -TNB derivatives is then reconverted to the Fab '-thiol by reduction with 
mercaptoethylamine and is mixed with an equimolar amount of the other Fab' -TNB 
derivative to form the bispecific antibody. The bispecific antibodies produced can be used 
as agents for the selective immobilization of enzymes. 
30 Additionally, Fab' fragments can be directly recovered from E. coli and chemically 

coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med,. 175, 217-225 (1992) 
describe the production of a fully humanized bispecific antibody F(ab') 2 molecule. Each 
Fab' fragment was separately secreted from E. coli and subjected to directed chemical 
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coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as 
trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor targets. 
Various techniques for making and isolating bispecific antibody fragments directly 
5 from recombinant cell culture have also been described. For example, bispecific antibodies 
have been produced using leucine zippers. Kostelny et al., J. Immunol. 148(5), 1547-1553 
(1992). The leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' 
portions of two different antibodies by gene fusion. The antibody homodimers were reduced 
at the hinge region to form monomers and then re-oxidized to form the antibody 

10 heterodimers. This method can also be utilized for the production of antibody homodimers. 
The "diabody" technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90, 
6444-6448 (1993) has provided an alternative mechanism for making bispecific antibody 
fragments. The fragments comprise a heavy-chain variable domain (V H ) connected to a 
light-chain variable domain (V L ) by a linker which is too short to allow pairing between the 

1 5 two domains on the same chain. Accordingly, the V H and V L domains of one fragment are 
forced to pair with the complementary V L and V H domains of another fragment, thereby 
forming two antigen-binding sites. Another strategy for making bispecific antibody 
fragments by the use of single-chain Fv (sFv) dimers has also been reported. See, Gruber et 
al., J. Immunol. 152, 5368 (1994). 

20 Antibodies with more than two valencies are contemplated. For example, trispecific 

antibodies can be prepared. Tutt et al., J. Immunol. 147, 60 (1991). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of 
which originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm 
of an immunoglobulin molecule can be combined with an arm which binds to a triggering 

25 molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CDS, CD28, or B7), 
or Fc receptors for IgG (Fc-yR), such as FC7RI (CD64), FC7RII (CD32) and Fc-yRHI (CD 16) 
so as to focus cellular defense mechanisms to the cell expressing the particular antigen. 
Bispecific antibodies can also be used to direct cytotoxic agents to cells which express a 
particular antigen. These antibodies possess an antigen-binding arm and an arm which binds 

30 a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, DOT A, or TETA. 

Another bispecific antibody of interest binds the protein antigen described herein and further 
binds tissue factor (TF). 
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4.13.7 HETEROCONJUGATE ANTIBODIES 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 
antibodies have, for example, been proposed to target immune system cells to unwanted cells 
5 (U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 
92/200373; EP 03089). It is contemplated that the antibodies can be prepared in vitro using 
known methods in synthetic protein chemistry, including those involving crosslinking 
agents. For example, immunotoxins can be constructed using a disulfide exchange reaction 
or by forming a thioether bond. Examples of suitable reagents for this purpose include 
10 iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. 
Patent No. 4,676,980. 

4.13.8 EFFECTOR FUNCTION ENGINEERING 

It can be desirable to modify the antibody of the invention with respect to effector 
function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing 
interchain disulfide bond formation in this region. The homodimeric antibody thus 
generated can have improved internalization capability and/or increased complement- 
mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et 
al, J. Exp Med., 176, 1 191-1 195 (1992) and Shopes, J. Immunol., 148, 2918-2922 (1992). 
Homodimeric antibodies with enhanced anti-tumor activity can also be prepared using 
heterobiflinctional cross-linkers as described in Wolff et al. Cancer Research, 53, 2560- 
2565 (1993). Alternatively, an antibody can be engineered that has dual Fc regions and can 
thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et al., 
Anti-Cancer Drug Design, 3, 219-230 (1989). 

4.13.9 IMMUNOCONJUGATES 

The invention also pertains to immunoconjugates coxnprising an antibody conjugated 
to a cytotoxic agent such as a chemo therapeutic agent, toxin (e.g., an enzymatically active 
30 toxin of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive 
isotope (i.e., a radio conjugate). 

Chemo therapeutic agents useful in the generation of such immunoconjugates have 
been described above. Enzymatically active toxins and fragments thereof that can be used 
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include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A 
chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, 
alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins 
(PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria 
5 officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the 

tricothecenes. A variety of radionuclides are available for the production of radioconjugated 
antibodies. Examples include 212 Bi, 131 I, 131 Li, 90 Y, and l86 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of 
bifunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate 

10 (SPDP), iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl 
adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediarnine), bis- 
diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates 
(such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such as 1,5-difluoro- 

15 2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in 
Vitetta et al., Science, 238: 1098 (1987). Carbon- 14-labeled l-isothiocyanatobenzyl-3- 
methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for 
conjugation of radionucleotide to the antibody. See W094/1 1026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 

20 streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn 
conjugated to a cytotoxic agent. 



4.14 COMPUTER READABLE SEQUENCES 

In one application of this embodiment, a nucleotide sequence of the present invention 
can be recorded on computer readable media. As used herein, "computer readable media" 
refers to any medium which can be read and accessed directly by a computer. Such media 
30 include, but are not limited to: magnetic storage media, such as floppy discs, hard disc 
storage medium, and magnetic tape; optical storage media such as CD-ROM; electrical 
storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. A skilled artisan can readily appreciate how any of the 
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presently known computer readable mediums can be used to create a manufacture 
comprising computer readable medium having recorded thereon a nucleotide sequence of the 
present invention. As used herein, "recorded" refers to a process for storing information on 
computer readable medium. A skilled artisan can readily adopt any of the presently known 
5 methods for recording information on computer readable medium to generate manufactures 
comprising the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. The choice of the data storage structure will generally be based on the means 

10 chosen to access the stored information. In addition, a variety of data processor programs 
and formats can be used to store the nucleotide sequence information of the present 
invention on computer readable medium. The sequence information can be represented in a 
word processing text file, formatted in commercially-available software such as WordPerfect 
and Microsoft Word, or represented in the form of an ASCII file, stored in a database 

15 application, such as DB2, Sybase, Oracle, or the like. A skilled artisan can readily adapt any 
number of data processor structuring formats {e.g. text file or database) in order to obtain 
computer readable medium having recorded thereon the nucleotide sequence information of 

the present invention. 

By providing any of the nucleotide sequences SEQ ID NO: 1-971, or 1943-2709 or a 

20 representative fragment thereof; or a nucleotide sequence at least 95% identical to any of the 
nucleotide sequences of SEQ ID NO: 1-971, or 1943-2709 in computer readable form, a 
skilled artisan can routinely access the sequence information for a variety of purposes. 
Computer software is publicly available which allows a skilled artisan to access sequence 
information provided in a computer readable medium. The examples which follow 

25 demonstrate how software which implements the BLAST (Altschul et al., J. Mol. BioL 

215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. Chem. 17:203-207 (1993)) search 
algorithms on a Sybase system is used to identify open reading frames (ORFs) within a 
nucleic acid sequence. Such ORFs may be protein-encoding fragments and may be useful in 
producing commercially important proteins such as enzymes used in fermentation reactions 

30 and in the production of commercially useful metabolites. 

As used herein, M a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the 
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present invention comprises a central processing unit (CPU), input means, output means, 
and data storage means. A skilled artisan can readily appreciate that any one of the currently 
available computer-based systems are suitable for use in the present invention. As stated 
above, the computer-based systems of the present invention comprise a data storage means 
5 having stored therein a nucleotide sequence of the present invention and the necessary 

hardware means and software means for supporting and implementing a search means. As 
used herein, "data storage means" refers to memory which can store nucleotide sequence 
information of the present invention, or a memory access means which can access 
manufactures having recorded thereon the nucleotide sequence information of the present 
10 invention. 

As used herein, "search means" refers to one or more programs which are 
implemented on the computer-based system to compare a target sequence or target structural 
motif with the sequence information stored within the data storage means. Search means are 
used to identify fragments or regions of a known sequence which match a particular target 

1 5 sequence or target motif. A variety of known algorithms are disclosed publicly and a variety 
of commercially available software for conducting search means are and can be used in the 
computer-based systems of the present invention. Examples of such software includes, but 
is not limited to, Smith- Waterman, MacPattern (EMBL), BLASTN and BLASTA 
(NP OLYPEPTEDEIA) . A skilled artisan can readily recognize that any one of the available 

20 algorithms or implementing software packages for conducting homology searches can be 
adapted for use in the present computer-based systems. As used herein, a "target sequence" 
can be any nucleic acid or amino acid sequence of six or more nucleotides or two or more 
amino acids. A skilled artisan can readily recognize that the longer a target sequence is, the 
less likely a target sequence will be present as a random occurrence in the database. The 

25 most preferred sequence length of a target sequence is from about 10 to 300 amino acids, 
more preferably from about 30 to 100 nucleotide residues. However, it is well recognized 
that searches for commercially important fragments, such as sequence fragments involved in 
gene expression and protein processing, may be of shorter length. 

As used herein, "a target structural motif," or "target motif," refers to any rationally 

30 selected sequence or combination of sequences in which the sequence(s) are chosen based on 
a three-dimensional configuration which is formed upon the folding of the target motif. 
There are a variety of target motifs known in the art. Protein target motifs include, but are 
not limited to, enzyme active sites and signal sequences. Nucleic acid target motifs include, 
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but are not limited to, promoter sequences, hairpin structures and inducible expression 
elements (protein binding sequences). 



4.15 TRIPLE HELIX FORMATION 

5 In addition, the fragments of the present invention, as broadly described, can be used 

to control gene expression through triple helix formation or antisense DNA or RNA, both of 
which methods are based on the binding of a polynucleotide sequence to DNA or RNA. 
Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and 
are designed to be complementary to a region of the gene involved in transcription (triple 

10 helix-see Lee et al., Nucl. Acids Res. 6, 3073 (1979); Cooney et al., Science 15241, 456 
(1988); and Dervan et al., Science 251, 1360 (1991)) or to the mRNA itself (antisense- 
Olmno, J. Neurochem. 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of 
Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple helix-formation optimally 
results in a shut-off of RNA transcription from DNA, while antisense RNA hybridization 

1 5 blocks translation of an mRNA molecule into polypeptide. Both techniques have been 

demonstrated to be effective in model systems. Information contained in the sequences of 
the present invention is necessary for the design of an antisense or triple helix 
oligonucleotide. 

20 4.16 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or expression 
of one of the ORFs of the present invention, or homolog thereof, in a test sample, using a 
nucleic acid probe or antibodies of the present invention, optionally conjugated or otherwise 
associated with a suitable label. 

25 In general, methods for detecting a polynucleotide of the invention can comprise 

contacting a sample with a compound that binds to and forms a complex with the 
polynucleotide for a period sufficient to form the complex, and detecting the complex, so 
that if a complex is detected, a polynucleotide of the invention is detected in the sample. 
Such methods can also comprise contacting a sample under stringent hybridization 

30 conditions with nucleic acid primers that anneal to a polynucleotide of the invention under 
such conditions, and amplifying annealed polynucleotides, so that if a polynucleotide is 
amplified, a polynucleotide of the invention is detected in the sample. 
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In general, methods for detecting a polypeptide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the 
polypeptide for a period sufficient to form the complex, and detecting the complex, so that if 
a complex is detected, a polypeptide of the invention is detected in the sample. 
5 In detail, such methods comprise incubating a test sample with one or more of the 

antibodies or one or more of the nucleic acid probes of the present invention and assaying 
for binding of the nucleic acid probes or antibodies to components within the test sample. 

Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 
Incubation conditions depend on the format employed in the assay, the detection methods 

10 employed, and the type and nature of the nucleic acid probe or antibody used in the assay. 
One skilled in the art will recognize that any one of the commonly available hybridization, 
amplification or immunological assay formats can readily be adapted to employ the nucleic 
acid probes or antibodies of the present invention. Examples of such assays can be found in 
Chard, T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science 

15 Publishers, Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in 

hnmunocytochemistry, Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 
(1985); Tijssen, P., Practice and Theory of immunoassays: Laboratory Techniques in 
Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, The 
Netherlands (1985). The test samples of the present invention include cells, protein or 

20 membrane extracts of cells, or biological fluids such as sputum, blood, serum, plasma, or 
urine. The test sample used in the above-described method will vary based on the assay 
format, nature of the detection method and the tissues, cells or extracts used as the sample to 
be assayed. Methods for preparing protein extracts or membrane extracts of cells are well 
known in the art and can be readily be adapted in order to obtain a sample which is 

25 compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 
necessary reagents to carry out the assays of the present invention. Specifically, the 
invention provides a compartment kit to receive, in close confinement, one or more 
containers which comprises: (a) a first container comprising one of the probes or antibodies 

30 of the present invention; and (b) one or more other containers comprising one or more of the 
following: wash reagents, reagents capable of detecting presence of a bound probe or 
antibody. 
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In detail, a compartment kit includes any kit in which reagents are contained in 
separate containers. Such containers include small glass containers, plastic containers or 
strips of plastic or paper. Such containers allows one to efficiently transfer reagents from 
one compartment to another compartment such that the samples and reagents are not 
5 cross-contaminated, and the agents or solutions of each container can be added in a 
quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the antibodies used 
in the assay, containers which contain wash reagents (such as phosphate buffered saline, 
Tris-buffers, etc.), and containers which contain the reagents used to detect the bound 

10 antibody or probe. Types of detection reagents include labeled nucleic acid probes, labeled 
secondary antibodies, or in the alternative, if the primary antibody is labeled, the enzymatic, 
or antibody binding reagents which are capable of reacting with the labeled antibody. One 
skilled in the art will readily recognize that the disclosed probes and antibodies of the present 
invention can be readily incorporated into one of the established kit formats which are well 

1 5 known in the art. 



4.17 MEDICAL IMAGING 

The novel polypeptides and binding partners of the invention are useful in medical 
imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the 
20 invention is involved in the immune response, for imaging sites of inflammation or 
infection).' See, e.g., Kunkel et al., U.S. Pat. NO. 5,413,778. Such methods involve 
chemical attachment of a labeling or imaging agent, administration of the labeled 
polypeptide to a subject in a pharmaceutic ally acceptable carrier, and imaging the labeled 
polypeptide in vivo at the target site. 

25 

4.18 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present 
invention further provides methods of obtaining and identifying agents which bind to a 
polypeptide encoded by an ORF corresponding to any of the nucleotide sequences set forth 
30 in SEQ ID NO: 1-971, or 1943-2709, or bind to a specific domain of the polypeptide 
encoded by the nucleic acid. In detail, said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the 
present invention, or nucleic acid of the invention; and 
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(b) determining whether the agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 
polynucleotide of the invention can comprise contacting a compound with a polynucleotide 
of the invention for a time sufficient to form a polynucleotide/compound complex, and 
5 detecting the complex, so that if a polynucleotide/compound complex is detected, a 
compound that binds to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to 
a polypeptide of the invention can comprise contacting a compound with a polypeptide of 
the invention for a time sufficient to form a polypeptide/compound complex, and detecting 
10 the complex, so that if a polypeptide/compound complex is detected, a compound that binds 
to a polynucleotide of the invention is identified. 

Methods for identifying compounds that bind to a polypeptide of the invention can 
also comprise contacting a compound with a polypeptide of the invention in a cell for a time 
sufficient to form a polypeptide/compound complex, wherein the complex drives expression 
15 of a receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 
sequence expression, so that if a polypeptide/compound complex is detected, a compound 
that binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate the 
activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to 
20 activity observed in the absence of the compound). Alternatively, compounds identified via 
such methods can include compounds which modulate the expression of a polynucleotide of 
the invention (that is, increase or decrease expression relative to expression levels observed 
in the absence of the compound). Compounds, such as compounds identified via the 
methods of the invention, can be tested using standard assays well known to those of skill in 
25 the art for their ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be 
selected and screened at random or rationally selected or designed using protein modeling 
techniques. 

30 For random screening, agents such as peptides, carbohydrates, pharmaceutical agents 

and the like are selected at random and are assayed for their ability to bind to the protein 
encoded by the ORP of the present invention. Alternatively, agents may be rationally 
selected or designed. As used herein, an agent is said to be "rationally selected or designed" 
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when the agent is chosen based on the configuration of the particular protein. For example, 
one skilled in the art can readily adapt currently available procedures to generate peptides, 
pharmaceutical agents and the like, capable of binding to a specific peptide sequence, in 
order to generate rationally designed antipeptide peptides, for example see Hurby et al., 
5 Application of Synthetic Peptides: Antisense Peptides," In Synthetic Peptides, A User's 
Guide, W.H. Freeman, NY (1992), pp. 289-307, and Kaspczak et al., Biochemistry 
28:9230-8 (1989), or pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the present invention, as broadly 
described, can be used to control gene expression through binding to one of the ORFs or 

10 EMFs of the present invention. As described above, such agents can be randomly screened 
or rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design 
sequence specific or element specific agents, modulating the expression of either a single 
ORF or multiple ORFs which rely on the same EMF for expression control. One class of 
DNA binding agents are agents which contain base residues which hybridize or form a triple 

1 5 helix formation by binding to DNA or RNA. Such agents can be based on the classic 

phosphodiester, ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric 
derivatives which have base attachment capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - 

20 see Lee et al., Nucl. Acids Res. 6, 3073 (1979); Cooney et al., Science 241, 456 (1988); and 
Dervan et al., Science 251, 1360 (1991)) or to the mRNA itself (antisense-Okano, J. 
Neurochem. 56, 560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene 
Expression, CRC Press, Boca Raton, FL (1988)). Triple helix-formation optimally results in 
a shut-off of RNA transcription from DNA, while antisense RNA hybridization blocks 

25 translation of an mRNA molecule into polypeptide. Both techniques have been 

demonstrated to be effective in model systems. Information contained in the sequences of 
the present invention is necessary for the design of an antisense or triple helix 
oligonucleotide and other DNA binding agents. 

Agents which bind to a protein encoded by one of the ORFs of the present invention 

30 can be used as a diagnostic agent. Agents which bind to a protein encoded by one of the 
ORFs of the present invention can be formulated using known techniques to generate a 
pharmaceutical composition. 
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4.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic 
acid hybridization probes capable of hybridizing with naturally occurring nucleotide 
sequences. The hybridization probes of the subject invention may be derived from any of 
5 the nucleotide sequences SEQ ID NO: 1-971, or 1943-2709. Because the corresponding 
gene is only expressed in a limited number of tissues, a hybridization probe derived from 
any of the nucleotide sequences SEQ ID NO: 1-971, or 1943-2709 can be used as an 
indicator of the presence of RNA of cell type of such a tissue in a sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 
10 hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,188 provides 

additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used 
in PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both. 
The probe will comprise a discrete nucleotide sequence for the detection of identical 
sequences or a degenerate pool of possible sequences for identification of closely related 

15 genomic sequences. 

Other means for producing specific hybridization probes for nucleic acids include the 
cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such 
vectors are known in the art and are commercially available and may be used to synthesize 
RNA probes in vitro by means of the addition of the appropriate RNA polymerase as T7 or 

20 SP6 RNA polymerase and the appropriate radioactively labeled nucleotides. The nucleotide 
sequences may be used to construct hybridization probes for mapping their respective 
genomic sequences. The nucleotide sequence provided herein may be mapped to a 
chromosome or specific regions of a chromosome using well-known genetic and/or 
chromosomal mapping techniques. These techniques include in situ hybridization, linkage 

25 analysis against known chromosomal markers, hybridization screening with libraries or 

flow-sorted chromosomal preparations specific to known chromosomes, and the like. The 
technique of fluorescent in situ hybridization of chromosome spreads has been described, 
among other places, in Verma et al (1988) Human Chromosomes: A Manual of Basic 
Techniques, Pergamon Press, New York NY. 

30 Fluorescent in situ hybridization of chromosomal preparations and other physical 

chromosome mapping techniques may be correlated with additional genetic map data. 
Examples of genetic map data can be found in the 1994 Genome Issue of Science 
(265 : 1 98 1 f) . Correlation between the location of a nucleic acid on a physical chromosomal 
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map and a specific disease (or predisposition to a specific disease) may help delimit the 
region of DNA associated with that genetic disease. The nucleotide sequences of the subject 
invention may be used to detect differences in gene sequences between normal, carrier or 
affected individuals. 

5 4.20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, i.e., small nucleic acid segments, may be readily prepared by, for 
example, directly synthesizing the oligonucleotide by chemical means, as is commonly 
practiced using an automated oligonucleotide synthesizer. 

Support bound oligonucleotides may be prepared by any of the methods known to those 

10 of skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy 
is to precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can 
be achieved using passive adsorption (Inouye & Hondo, (1990) J. Clin. Microbiol. 28(6), 1469- 
72); using UV light (Nagata et aL, 1985; Dahlen et aL, 1987; Morrissey & Collins, (1989) Mol. 
Cell Probes 3(2) 189-207) or by covalent binding of base modified DNA (Keller et aL, 1988; 

1 5 1989); all references being specifically incorporated herein. 

Another strategy that may be employed is the use of the strong biotin-streptavidin 
interaction as a linker. For example, Broude et al. (1994) Proc. Natl. Acad. Sci. USA 91(8), 
3072-6, describe the use of biotinylated probes, although these are duplex probes, that are 
immobilized on streptavidin-coated magnetic beads. Streptavidin-coated beads may be 

20 purchased from Dynal, Oslo. Of course, this same linking chemistry is applicable to coating 
any surface with streptavidin. Biotinylated probes may be purchased from various sources, 
such as, e.g., Operon Technologies (Alameda, CA). 

Nunc Laboratories (Naperville, IL) is also selling suitable material that could be used. 
Nunc Laboratories have developed a method by which DNA can be covalently bound to the 

25 microwell surface termed Covalink NH. CovaLink NH is a polystyrene surface grafted with 
secondary amino groups (>NH) that serve as bridgeheads for further covalent coupling. 
CovaLink Modules may be purchased from Nunc Laboratories. DNA molecules may be bound 
to CovaLink exclusively at the 5 -end by a phosphoramidate bond, allowing immobilization of 
more than 1 pmol of DNA (Rasmussen et al, (1991) Anal. Biochem. 198(1) 138-42). 

30 The use of CovaLink NH strips for covalent binding of DNA molecules at the 5 -end 

has been described (Rasmussen et al., (1991). In this technology, a phosphoramidate bond is 
employed (Chu et al., (1983) Nucleic Acids Res. 11(8) 6513-29). This is beneficial as 
immobilization using only a single covalent bond is preferred. The phosphoramidate bond joins 



WO 03/029271 PCT/US02/30474 

107 

the DNA to the CovaLink NH secondary amino groups that are positioned at the end of spacer 
arms covalently grafted onto the polystyrene surface through a 2 nm long spacer arm. To link 
an oligonucleotide to CovaLink NH via an phosphoramidate bond, the oligonucleotide terminus 
must have a 5 f -end phosphate group. It is, perhaps, even possible for biotin to be covalently 
5 bound to CovaLink and then streptavidin used to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/pl) and 
denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1M1- 
methylimidazole, pH 7.0 (1-Melm 7 ), is then added to a final concentration of 10 mM 1-Melm 7 . 
A ss DNA solution is then dispensed into CovaLink NH strips (75 ]Lil/well) standing on ice. 
10 Carbodiimide 0.2 M l-ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), 

dissolved in 10 mM l-Melmy, is made fresh and 25 jlxI added per well. The strips are incubated 
for 5 hours at 50°C. After incubation the strips are washed using, e.g., Nunc-Immuno Wash; 
first the wells are washed 3 times, then they are soaked with washing solution for 5 min., and 
finally they are washed 3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS 

15 heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is 
that described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated 
herein by reference. This method of preparing an oligonucleotide bound to a support involves 
attaching a nucleoside 3 '-reagent through the phosphate group by a covalent phosphodiester link 

20 to aliphatic hydroxyl groups carried by the support. The oligonucleotide is then synthesized on 
the supported nucleoside and protecting groups removed from the synthetic oligonucleotide 
chain under standard conditions that do not cleave the oligonucleotide from the support. 
Suitable reagents include nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe 

25 arrays may be employed. For example, addressable laser-activated photodeprotection may be 
employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described 
by Fodor et al (1991) Science 25 1(4995), 767-73, incorporated herein by reference. Probes 
may also be immobilized on nylon supports as described by Van Ness et al (1 991) Nucleic 
Acids Res., 19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) 

30 Anal. Biochem. 1 69(1), 104-8; all references being specifically incorporated herein. 

To link an oligonucleotide to a nylon support, as described by Van Ness et al (1991), 
requires activation of the nylon surface via alkylation and selective activation of the 5-amine of 
oligonucleotides with cyanuric chloride. 
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One particular way to prepare support bound oligonucleotides is to utilize the 
light-generated synthesis described by Pease et al, (1994) Proc. Nat'l. Acad. Sci., USA 91(11), 
5022-6, incorporated herein by reference). These authors used current photolithographic 
techniques to generate arrays of immobilized oligonucleotide probes (DNA chips). These 
5 methods, in which light is used to direct the synthesis of oligonucleotide probes in high-density, 
miniaturized arrays, utilize photolabile S'-protected N-acyl-deoxynucleoside phosphoramidites, 
surface linker chemistry and versatile combinatorial synthesis strategies. A matrix of 256 
spatially defined oligonucleotide probes may be generated in this manner. 

4.21 PREPARATION OF NUCLEIC ACID FRAGMENTS 

10 The nucleic acids may be obtained from any appropriate source, such as cDNAs, 

genomic DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC 

inserts, and RNA, including mRNA without any amplification steps. For example, Sambrook 

et al (1989) describes three protocols for the isolation of high molecular weight DNA from 

mammalian cells (p. 9.14-9.23). 
1 5 DNA fragments may be prepared as clones in M13, plasmid or lambda vectors and/or 

prepared directly from genomic DNA or cDNA by PCR or other amplification methods. 

Samples may be prepared or dispensed in multiwell plates. About 100-1000 ng of DNA 

samples may be prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those of 
20 skill in the art including, for example, using restriction enzymes as described at 9.24-9.28 of 

Sambrook et al (1989), shearing by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al (1990) 

Nucleic Acids Res. 18(24), 7455-6, incorporated herein by reference). In this method, DNA 

samples are passed through a small French pressure cell at a variety of low to intermediate 
25 pressures. A lever device allows controlled application of low to intermediate pressures to the 

cell. The results of these studies indicate that low-pressure shearing is a useful alternative to 

sonic and enzymatic DNA fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the 

two base recognition endonuclease, CVzJI, described by Fitzgerald et al (1992) Nucleic Acids 
30 Res. 20(14) 3753-62. These authors described an approach for the rapid fragmentation and 

fractionation of DNA into particular sizes that they contemplated to be suitable for shotgun 

cloning and sequencing. 
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The restriction endonuclease Cvi JI normally cleaves the recognition sequence PuGCPy 
between the G and C to leave blunt ends. Atypical reaction conditions, which alter the 
specificity of this enzyme (CvzJI**), yield a quasi-random distribution of DNA fragments form 
the small molecule pUC19 (2688 base pairs). Fitzgerald et al (1992) quantitatively evaluated 
5 the randomness of this fragmentation strategy, using a CWJI** digest of pUC19 that was size 
fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z 
minus Ml 3 cloning vector. Sequence analysis of 76 clones showed that CvzJI** restricts 
pyGCPy and PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated 
at a rate consistent with random fragmentation. 

10 As reported in the literature, advantages of this approach compared to sonication and 

agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 |j,g instead of 
2-5 jag); and fewer steps are involved (no preligation, end repair, chemical extraction, or 
agarose gel electrophoresis and elution are needed). 

Irrespective of the manner in which the nucleic acid fragments are obtained or prepared, 

15 it is important to denature the DNA to give single stranded pieces available for hybridization. 
This is achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is 
then cooled quickly to 2°C to prevent renaturation of the DNA fragments before they are 
contacted with the chip. Phosphate groups must also be removed from genomic DNA by 
methods known in the art. 

20 4.22 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon 
membrane. Spotting may be performed by using arrays of metal pins (the positions of which 
correspond to an array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a 
DNA solution to a nylon membrane. By offset printing, a density of dots higher than the density 

25 of the wells is achieved. One to 25 dots may be accommodated in 1 mm 2 , depending on the 
type of label used. By avoiding spotting in some preselected number of rows and columns, 
separate subsets (subarrays) may be formed. Samples in one subarray may be the same genomic 
segment of DNA (or the same gene) from different individuals, or maybe different, overlapped 
genomic clones. Each of the subarrays may represent replica spotting of the same samples. In 

30 one example, a selected gene segment may be amplified from 64 patients. For each patient, the 
amplified gene segment may be in one 96-well plate (all 96 wells containing the same sample). 
A plate for each of the 64 patients is prepared. By using a 96-pin device, all samples may be 
spotted on one 8 x 12 cm membrane. Subarrays may contain 64 samples, one from each patient. 
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Where the 96 subarrays are identical, the dot span may be 1 mm 2 and there may be a 1 mm 
space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, 
Illinois) which may be partitioned by physical spacers e.g. a plastic grid molded over the 
5 membrane, the grid being similar to the sort of membrane applied to the bottom of multiwell 
plates, or hydrophobic strips. A fixed physical spacer is not preferred for imaging by exposure 
to flat phosphor-storage screens or x-ray films. 

The present invention is illustrated in the following examples. Upon consideration of 
the present disclosure, one of skill in the art will appreciate that many other embodiments and 
10 variations may be made in the scope of the present invention. Accordingly, it is intended that 
the broader aspects of the present invention not be limited to the disclosure of the following 
examples. The present invention is not to be limited in scope by the exemplified embodiments 
which are intended as illustrations of single aspects of the invention, and compositions and 
methods which are functionally equivalent are within the scope of the invention. Indeed, 
1 5 numerous modifications and variations in the practice of the invention are expected to occur to 
those skilled in the art upon consideration of the present preferred embodiments. Consequently, 
the only limitations which should be placed upon the scope of the invention are those which 
appear in the appended claims. 

All references cited within the body of the instant specification are hereby incorporated 
20 by reference in their entirety. 

5.0 EXAMPLES 

* 

5.1 EXAMPLE 1 

Novel Nucleic Acid Sequences Obtained From Various Libraries 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared from 
25 various human tissues and in some cases isolated from a genomic library derived from human 
chromosome using standard PGR, SBH sequence signature analysis and Sanger sequencing 
techniques. The inserts of the library were amplified with PGR using primers specific for the 
vector sequences which flank the inserts. Clones from cDNA libraries were spotted on nylon 
membrane filters and screened with oligonucleotide probes (e.g., 7-mers) to obtain signature 
30 sequences. The clones were clustered into groups of similar or identical sequences. 
Representative clones were selected for sequencing. 
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In some cases, the 5' sequence of the amplified inserts was then deduced using a 
typical Sanger sequencing protocol. PCR products were purified and subjected to fluorescent 
dye terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied 
Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. 

5 5.2 EXAMPLE 2 

Assemblage of Novel Nucleic Acids 

The contigs or nucleic acids of the present invention, designated as SEQ ID NO: 1943- 
2709 were assembled using an EST sequence as a seed. Then a recursive algorithm was used to 
extend the seed EST into an extended assemblage, by pulling additional sequences from 

10 different databases (i.e., Hyseq's database containing EST sequences, dbEST, gb pri, and 
UniGene, and exons from public domain genomic sequences predicated by GenScan) that 
belong to this assemblage. The algorithm terminated when there were no additional sequences 
from the above databases that would extend the assemblage. Further, inclusion of component 
sequences into the assemblage was based on a BLASTN hit to the extending assemblage with 

1 5 BLAST score greater than 300 and percent identity greater than 95%. 

The novel predicted polypeptides (including proteins), SEQ ID NO: 2710-3476, 
encoded by the novel polynucleotides (SEQ ID NO: 1943-2709) of the present invention, and 
their corresponding translation start and stop nucleotide locations to each of SEQ ID NO: 1943- 
2709 were obtained using one of two methods. Polypeptides were obtained by using a 

20 software program called FASTY (available from http ://fasta.bioch. virginia,edu) which selects a 
polypeptide based on a comparison of the translated novel polynucleotide to known 
polynucleotides (W.R. Pearson, Methods in Enzymology, 183:63-98 (1990), herein 
incorporated by reference). Alternatively, polypeptides were obtained by using a software 
program called GenScan for human/vertebrate sequences (available from Stanford University, 

25 Office of Technology Licensing) that predicts the polypeptide based on a probabilistic model of 
gene structure/compositional properties (C. Burge and S. Karlin, J. Mol. Biol., 268:78-94 
(1997), incorporated herein by reference). Method C refers to a polypeptide obtained by using 
a Hyseq proprietary software program that translates the novel polynucleotide and its 
complementary strand into six possible amino acid sequences (forward and reverse frames) and 

30 chooses the polypeptide with the longest open reading frame. 



5.3 EXAMPLE 3 
Novel Nucleic Acids 
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The novel nucleic acids of the present invention were assembled from sequences that 
were obtained from a cDNA library by methods described in Example 1 above, and in some 
cases sequences obtained from one or more public databases. The nucleic acids were 
assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend the 
5 seed EST into an extended assemblage, by pulling additional sequences from different 

databases (Hyseq's database containing EST sequences, dbEST, gb pri, and UniGene) that 
belong to this assemblage. The algorithm terminated when there was no additional sequences 
from the above databases that would extend the assemblage. Inclusion of component sequences 
into the assemblage was based on a BLASTN hit to the extending assemblage with BLAST 

1 0 score greater than 300 and percent identity greater than 95%. 

Using PHRAP (Univ. of Washington) or CAP4 (Paracel), a full-length gene cDNA 
sequence and its corresponding protein sequence were generated from the assemblage. Any 
frame shifts and incorrect stop codons were corrected by hand editing. During editing, the 
sequences were checked using FASTY and/or BLAST against Genebank (i.e., dbEST, gb pri, 

1 5 UniGene, and Genpept) and the Geneseq (Derwent). Other computer programs which may 

have been used in the editing process were phredPhrap and Consed (University of Washington) 
and ed-ready, ed-ext and cg-zip-2 (Hyseq, Inc.). The full-length nucleotide and amino acid 
sequences, including splice variants resulting from these procedures are shown in the Sequence 
Listing as SEQ ID NO: 1-1942. 

20 Table 1 shows the various tissue sources of SEQ ID NO: 1-971 . 

The results showing homologues for SEQ ID NO: 972-1942 from Genpept 124 are 
shown in Table 2A. The results showing homologues for SEQ ID NO: 972-1942 from 
Genpept 129 are shown in Table 2B. 

Using eMatrix software package (Stanford University, Stanford, CA) (Wu et al., J. 

25 Comp. Biol., Vol. 6, 219-235 (1999), http://motif.stanford.edu/ematrix-search/ herein 
incorporated by reference), all the polypeptide sequences were examined to determine 
whether they had identifiable signature regions. Scoring matrices of the eMatrix software 
package are derived from the BLOCKS, PRINTS, PFAM, PRODOM, and DOMO 
databases. Table 3 shows the accession number of the homologous eMatrix signature found 

30 in the indicated polypeptide sequence, its description, and the results obtained which include 
accession number subtype; raw score; p-value; and the position of signature in amino acid 
sequence. * 
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Using the Pfam software program (Sonnhammer et al., Nucleic Acids Res., Vol. 
26(1) pp. 320-322 (1998) herein incorporated by reference) all the polypeptide sequences 
were examined for domains with homology to certain peptide domains. Table 4A shows the 
name of the Pfam model found, the description, the e-value and the Pfam score for the 
5 identified model within the sequence as described in United States priority application serial 
number 60/324,631, filed September 24, 2001, herein incorporated by reference in its 
entirety. Table 4B shows the name of the Pfam model found, the description, the e-value 
and the Pfam score for the identified model within the sequence using Pfam version 7.2. 
Further description of the Pfam models can be found at http ://pfam. wustl. edu/ . 

10 The GeneAtlas™ software package (Molecular Simulations Inc. (MSI), San Diego, 

CA) was used to predict the three-dimensional structure models for the polypeptides 
encoded by SEQ ID NO: 1-971 (i.e. SEQ ID NO: 972-1942). Models were generated by (1) 
PSI-BLAST which is a multiple alignment sequence profile-based searching developed by 
Altschul et al, (Nucl. Acids. Res. 25, 3389-3408 (1997)), (2) High Throughput Modeling 

15 (HTM) (Molecular Simulations Inc. (MSI) San Diego, CA,) which is an automated sequence 
and structure searching procedure (http://www.msi.com/ ), and (3) SeqFold™ which is a fold 
recognition method described by Fischer and Eisenberg (J. Mol. Biol. 209, 779-791 (1998)). 
This analysis was carried out, in part, by comparing the polypeptides of the invention with 
the known NMR (nuclear magnetic resonance) and x-ray crystal three-dimensional structures 

20 as templates. Table 5 shows: "PDB ID", the Protein DataBase (PDB) identifier given to 
template structure; "Chain ED", identifier of the subcomponent of the PDB template 
structure; "Compound Information", information of the PDB template structure and/or its 
subcomponents; "PDB Function Annotation" gives function of the PDB template as 
annotated by the PDB files ( http:/www.rcsb.org/PDB/ ); start and end amino acid position of 

25 the protein sequence aligned; PSI-BLAST score, the verify score, the SeqFold score, and the 
Potential(s) of Mean Force (PMF). The verify score is produced by GeneAtlas™ software 
(MSI), is based on Dr. Eisenberg's Profile-3D threading program developed in Dr. David 
Eisenberg's laboratory (US patent no. 5,436,850 and Luthy, Bowie, and Eisenberg, Nature, 
356:83-85 (1992)) and a publication by R. Sanchez and A. Sali, Proc. Natl. Acad. Sci. USA, 

30 95:13597-12502. The verify score produced by GeneAtlas normalizes the verify score for 
proteins with different lengths so that a unified cutoff can be used to select good models as 
follows: 
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Verify score (normalized) = (raw score - 1/2 high score)/(l/2 high score) 

The PFM score, produced by GeneAtlas™ software (MSI), is a composite scoring 
function that depends in part on the compactness of the model, sequence identity in the 
5 alignment used to build the model, pairwise and surface mean force potentials (MFP). As 
given in Table 5, a verify score between 0 to 1.0, with 1 being the best, represents a good 
model. Similarly, a PMF score between 0 to 1 .0, with 1 being the best, represents a good 
model. A SeqFold™ score of more than 50 is considered significant. A good model may 
also be determined by one of skill in the art based all the information in Table 5 taken in 
10 totality. 

Table 7 correlates nucleotide sequences of the invention to a specific chromosomal 
location when assignable. 

Table 6 is a correlation table of the novel polynucleotide sequences SEQ ID NO: 1- 
971, their corresponding polypeptide sequences SEQ ID NO: 972-1942, their corresponding 
15 priority contig nucleotide sequences SEQ ID NO: 1943-2709, their corresponding priority 
contig polypeptide sequences SEQ ID NO: 2710-3476, and the US serial number of the 
priority application (all of which are herein incorporated in their entirety), in which the 
contig sequence was filed. 

Table 8 is a correlation table of the novel polynucleotide sequences SEQ ID NO: 1- 
20 971, the novel polypeptide sequences SEQ ID NO: 972-1942, and the corresponding SEQ ID 
NO in which the sequence was filed in priority US application bearing serial number 
60/324,631, filed September 24, 2001. 
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Table 1 



Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 


adrenal gland 

• 


Clontech 


ADR002 


6-9 21 24 31 37 40 44 51 60 62-63 65-66 70-71 
75 78 80 83 86 90 108 113 117 135-136 148 
150 154-155 158-160 162 167 173-174 182 192 
197 201-202 209 213 219-220 222 229 244 248 
252 254-256 260 263 270 273 289 307 332 334 
337 345-346 371 373 392 402 407-408 423 467 
486 491 501 503-504 51 1 516 541 543 548 563- 

r f A r /* C> C~r/Z CT~1 COI tZ C\ A /"A/T /TOO /TT7 /'/I O (Z A C 

564 568 576-577 581 604 oUo 622 62/ 643-045 
673 ooo /U/ /ll /lo /o4 /4U-/4Z / d 1 toy 112. 
199 815 824 831 846 852 876-877 879 882-885 
888 904 918 940 944-945 952 954 962 971 


adult bladder 


Invitrogen 


BLD001 


21 28 47 51 58 60 66 70 104 144 166 180 187 
z J. o zzj Zjo zoo z /u-z / i z/o zou zyz .MU ^zz 
335-336 352-353 440 466 473 484 542 548 
597-598 604 646 685 750 800 953 971 


adult brain 


BioChain 


ABR012 


110 428 740-742 928 


adult brain 


BioChain 


ABR013 


298 


adult brain 


Clontech 


ABR001 


20 59 61 84 89 94 128 138 142 153 164 180 
201 279 281 287 290 297-298 301 307 313 324 
380 406 414 432 451 495 571 575 705 757 798 
800 831-832 861 863 876 883 944-945 


adult brain 


Clontech 


ABR006 


7-9 16-17 20 29 34-36 49-50 52 80 83-84 88 94 
96 109 118 120 124 126-127 133 135-136 150 
155 197 202 210 218 230-231 233 236 238 247 
252 261 270 272 274 278 281-284 286 291-292 
296 300 305 320 335-336 347 352 358 361 363 
370-371 381 388-390 393-394 396 401 403 417 
422 435 451 459 471 482 489-490 498 507 510 
516 532-533 552 565 568 582 590 592-593 
600-601 607 609 615 622 633-634 655 687 691 
708 726 732 736 750 753-754 757-758 766-769 
781 784 789-791 798 815 850 852 858-865 875 
879-880 886-887 916 921 939 948 952 957 970 


adult brain 


Clontech 


ABR008 


7-10 12-14 16 21 23-24 29 33-35 38 41 44 48 
51-52 54 61-62 65-66 68-71 78-80 83 85 89 96 
100 105-110 113 115-116 118 122-123 126 
131-133 135-136 138 143 146-149 151-155 158 
161 164 172-174 180 182-183 186-189 192 
196-197 201-202 205-206 213 215-216 220-221 
223 228-229 231 236-238 246 248 251-252 
255-256 258 260-261 264-265 269-271 274 
276-277 284 287-289 291 296-298 301 303 306 
309 311 313 319 326 328-329 331 333 337 348 
352 357 360 363 365 367 370-372 378 381-382 
384 386-390 394 397 399 401 406-407 410-412 
414-416 420-422 426 441 443-444 449-452 455 
459 466 468 475 479 489 491-494 496-498 
505-515 528 533 546 548 550-557 563-568 582 
592-594 603-607 610 612 615 622 627 633 636 
641-645 648-649 657-665 677 685 689 695-696 
705-706 71 1 718-723 726 729 732 734 736 
740-742 744 757 768 770 774 776 784-787 789 
798 800 808 812 818-819 827 829 850 856 862- 
865 868-871 873 875-877 879 882-883 885-887 
890-891 895 906 909-910 915 927-930 933-934 
939-942 944-945 947-948 955 957-958 970 
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Table 1 



Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 


adult brain 


Clontech 


ABR011 


894 939 


adult brain 


GIBCO 


AB3001 


5 21 37 52 61-62 70 88 96 110 112 122 127 133 
135-136 138 142 145 150 154 164 204 218 281 
305 312-317 353 466 502 581 648-649 685 691 
726 757 831 852 864-865 886 916 918 970 


adult brain 


GIBCO 


ABD003 


6-9 21 24 40 43-44 48 50-51 58-59 63 70 78 83- 
84 110 114-115 117 120 133 135-136 138 142 
144 146 148 150 154-155 159-160 164 166 172 
180 209 225 243-246 248 254 260 270 279 284 
305 309 320-322 324-327 330 353-354 359 362 

OTI n o A o f TAO A A1 A f\^i A f\ /T A f\0 AA^\ /ICO 

371 384-386 393 401-402 406 408 442 452 
466-467 488-490 502 541 551 592-593 615 636 

CAA £LT1 £.HH /^OC /ZQfZ HCYl TOO OAA 

644 o/3 o/i 05J-000 /U/ /zo /D/ /o3 oW oZo 
831 838 840 852 864-866 876-877 880 885 911 
952-953 971 


adult brain 


Invitrogen 


ABR014 


38 70 149 197 210 296 352 444 604 615 640 
705 833-835 864-865 882 915 


adult brain 


Invitrogen 


ABR015 


37 117 127 138 197 255-256 278 304 313 353 
467 476-477 604 768 885 


adult brain 


Invitrogen 


ABR016 


1 14 70 1 17 134 138 142 279 309 353 377 466 
911 


adult brain 


Invitrogen 


ABT004 


13 24 28 36 38 41 44 5 1 65 75 83 89 1 14-1 15 
117 127 138 142 148 152 166 173-174 182 203 
211-212 223 229 243 255-256 260 265 270-271 
277-278 289-290 295 309 319 322 328 331 346 
352 358 362 370-371 377 380-381 386 401 416 
420 437 453 455-457 491 548 563-564 568 575 
581 588 606 768 783 798 833-835 838 880 899 
903 970 


adult cervix 


BioChain 


CVX001 


5-7 16-17 19 21 23 25 31 36 44 48-51 59 62-63 
65 70 75-76 78 80 85 89-90 96 99-100 108-1 10 
112 115 117-118 120 122 124 126 133 135-136 
138 142 147 150 153-155 164-166 172 174 182 
185-186 188 190 192 194 196 200-201 209 
215-216 218 225 229 235 239-241 246 255-256 
260-261 267-268 270 274 276 278 287 291 
296-297 311 315 329-330 332 335 342 351-353 
364-365 368 371-372 374 378 394 404-405 414 
436 438-439 446 459 463 466 470 472 474 482 
489 502-504 526 533 541 551 567 569 577 582 
585-586 588 595-596 612 614 631 633 636 
641-643 645 648-649 666-667 681 689 691 697 
701 705 718 726 734 740-743 748-750 754 757 
768 783 785-786 800 815 819 826-827 829 831 
838 852 857 864-877 879-880 882 886 890 
906-907 916-917 922-923 929-930 933-936 948 
956 961 971 


adult colon 


Invitrogen 




i c OQ 'Xl 1Q A1 Q1 Q£ 1 1 ^ 190 "\1\A 1 1fv 1 0 1 
10 Zo d i-do 4.5 do y 1 yo i ij IZZ 1.34 1 /O Ly i 

209 260 270 336 341 344 401 427 450-451 467 
470 476-477 563-564 612 618 663 670 705 750 
755 847 883 907 937 951 971 


adult heart 


GIBCO 


AHR001 


6 8-9 13-14 20-22 28 30 36 40-41 45-47 49-50 
52 58-59 62-63 65-66 70-71 75 78 80 84-85 93 
96 101 109-111 113 118 126 141 148 150-151 
153 155 165 182 185-188 202 210 214-221 
224-226 235-236 239-240 243-244 246 252 254 
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Table 1 



Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 








258-259 263-265 271 274 282-284 297 311 319 
327 329 338 340 343-344 348 350 353 360 365 
387 396 402-404 408 421 426 430 452 455 464- 
467 483 490 499 502-504 536 538 541 551 569 
577 602 615 624-625 644 662 673 677 691 705- 
707 715-716 726 734 740-742 751 757 761 784 
787 800 819 822 838 852 864-866 876-877 881 
883 885-886 890-891 895 903-904 906 917 
919-920 929-930 937 948 966 971 


adult kidney 


GIBCO 


AKD001 


6-7 11-12 16-17 19 23-24 28 30 36-37 45-46 48 
51 55 58-60 62 66 70-75 78-80 85 88 96 101- 
102 104 108-109 117-118 120 122 125-126 133 
135-136 138 141 144 150 154-155 159-160 162 
164 166 168-169 172 174 177 185 187-188 191 
197 200-201 208-212 218 222-229 231-232 
239-241 244 246 254 258 266-272 278 280 289 
292-293 295 305 311 313 331-333 343-344 353 
357 364-365 368 371 378 406 409-412 423 435 
442-443 457 461 466-468 485-487 519 526 533 
548 556-557 563-564 567 569 577 582 588 612 
615 629 636-637 643-645 648-649 659 662-663 
685-686 697 707 726 740-743 750 754 763 772 
800 812 824-826 829 838 852 861 864-866 876 
880 882-883 885-886 903 906-907 919-921 
929-930 944-945 948 953 957 962 970-971 


adult kidney 


Invitrogen 


AKT002 


6-7 15-16 19 21 23 25 44 48 51 58-60 63 66 70 
72-74 80 89 92-93 96 99-100 109 114 117 120 
123 135-136 141-142 148 153 155 159-160 162 
173-174 182 185 187 194 201 203 226 228 231 
235 239-240 244 263 270 273 279 287 293-294 
301 309 322 326 334-336 355 368 371-373 378 
390 409 414 417 423 465-467 476-477 491-493 
502 526 544 548 563-564 568 582 606 612 615 
636 644 648-649 659 662 672-674 681 691 731 
740-742 744-745 765 768 774 782 784-786 800 
824 827 829 840 863-865 867 873 882 884-886 
891 903-904 906 917-918 929-930 937 944-945 
948 966 970-971 


adult liver 


Clontech 


ALV003 


2-4 11 40 59 72-74 91 148 184 277 293 312 467 
548 768 840 882 886 919 944-945 


adult liver 

• 


Invitrogen 


ALV002 


6-7 10-11 13 19 25 28 36 38 45-46 49 63 65 71- 
75 77 88 96 114 134 153 155 170 180 184 187 
200 202 219 232 236 248 255-256 269 272 277- 
278 297 309 332 353 356 362 407 409-412 445 
453 466 468 487 494 496 517 533 561 581 612 
643 645 701 750 760 826 829 840 881 883 903 
916 918 929-930 937 948 


adult lung 


GIBCO 


t -w* /"V r\ 1 

ALGOOl 


6 13 21 25 31 70 75 83 99 1 1 / 122 133 loo 186 
208 224 228 233 241 273 287 289 295 305 332 
337-338 353 365 414 421 441 466-467 479 491 
548 551 563-564 636 648-649 705 740-742 757 
868-871 899 921 


adult ovary 


Invitrogen 


AOVOOl 


5-7 11 13 16-17 19-23 25 28 36-38 40-43 45-46 
50-51 54 59-60 63 65-66 70-78 82-83 88-90 93 
96 99 102 105 108-110 117-118 120 122-124 
131 133-136 142 144 147-150 154-155 159-160 
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Table 1 



Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 








164 166 169-170 172-174 176-178 180 182-185 
188 191 196-197 200-201 204 209-213 215-216 
218 222-223 225 228-229 231 235-236 239-241 
243-244 246 254-258 260 263-264 269-270 
277-280 287 291-292 295-297 300 304-305 307 
312-313 315-317 323 327 329-332 340-347 
352-353 357-358 365 368 371 376 378 388-397 
401 404-407 410-412 414-415 417 421 423 437 
440-442 444 451 459 461 466 468-475 479 
483-484 490 492-493 502-504 510-511 519 
533-534 542 546 548 551 559 563-564 568 577 
588 592-593 597 602 612 615 622 626 631 635- 
636 643 645 648-649 659 670-671 673 681 683 
685 689 691 698 707 71 1 715-716 724 731 740- 
744 748 754 757-758 768 771-772 783 787 798 
800 812 815-816 826 831-832 838 840 844 852 
864-866 868-871 873 875-876 879-880 883 
885-887 903-904 906-908 911 916-918 921-923 
928-930 933-935 944-945 948 953 955 957-958 
961 965-966 971 


adult placenta 


Clontech 


APL001 


24 49 78 95 201 237 349 353 490 637 747 953 


adult spleen 


Clontech 


SPLcOl 


34_35 44 48-49 52 83 99 108 110 118 122 124 
126 150 153 155 161 186 194 197 201 206 215- 
216 223 232-233 241 248 270 280 287-289 
296-297 306 331-332 355 363 373 396 414-416 
444 446 451 463 466 468 509-510 533 552 556- 
557 561 565-566 576 601 606 634 636 656 710 
720 736-739 773 798 827 864-865 875 879 883 
899 916 919-920 940 952 955 970 


adult spleen 


GIBCO 


ASP001 


8-9 12 49 68-70 79 83 88 101 105 112 114 118 
122 133 148 174 218 224 231 260 287 295 316- 
317 337 341 344 352-353 398 421 423 441-442 
466 479 491 518 551 563-564 604 705 754 772 
829 910 955 970-971 


adult testis 


GIBCO 


ATS001 


20-21 30 51 54 58 70 72-74 83 96 101-102 107- 
108 117 141 150 153 155 187 203 211-212 220 
222-223 244 252 273 279 287 289 296 298 330 
338 344 353 365 417 443 467 484 489 503-504 
577 582 662 686 707 757 907-908 955 970-971 


bone marrow 


Clontech 


BMD001 


1-4 12 14 21 24 27 37 43-47 58 70 79-80 85 87- 
88 90 95 99 110 115-126 131 141 155 157-167 
169 172 185 192-193 195-197 200 205 215-216 
225 227 231 233 237 244 262 273 277 284 295 
298 301 313 329 331 344 353 357 365 371 378 
401 405-408 410-415 423 437 442 446 466-467 
474 479 490 503-504 541 548 563-564 567-568 
583 585-586 588 605 622 635-636 638 645 655 

/C/CO &.H'\ t^TTX £Q1 £0*7 HC\& 71 C HACi 1A1 I^A 

ooz o/i o/d ood oy / /uo /lo /zo / z *u-/hd /D^t 
757 783 825-826 831-832 850 852 864-866 906 
908 910-911 948 953 955 957 971 


bone marrow 


Clontech 


BMD007 


1 70 856 


bone marrow 


GF 


BMD002 


1-7 14 16 20 24 26 29 38 41 43-44 48 52 55 60 
65 70-71 75 78 80 82 85 88 90 96 107 112 114 
117 119 122 124-126 133 140 148-149 154-155 
158 161 164 166-167 172 174 186 188 193 196 
200 205 211-212 215-216 220-221 225 230 233 
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237 239-240 242 248 251-254 257 261 264-265 
273-274 284 287 290 294-298 305 307 310-311 
316-317 331 337 341 344 348 353 355 357 365 
370 373 376 378 387 389 398 401 408-409 414- 
416 421 423-424 427 441-442 446 455 463 
466-468 479 488 491 496 501 509 516 521 531 
533 539 541 543 546-548 550-551 558 563-564 
569 571 576 583 588 595 601 606 615 622 627 
651 673 706-707 709 71 1 720 724-725 727-728 
738-739 754 757 761 773 776 778-779 783 787 
791-793 800 815 820 826 829 831 846-847 850 
858 861-865 872 879 886 895 899 906-907 
909-910 933-935 941 944-945 952 959 


bone marrow 


NULL 


STM001 


41 70 82-83 248 261 332 430 840 882 


cultured 
preadipocytes 

* 


Stratagene 


ADP001 


8-9 21 23 41 48-49 58 66 70-71 79 82 92 105 
138 148 166 169 186 222-223 248 280 297 313 
329 348 353 407 423 442 452 466 492-493 551 
577 606 637-638 659 705 714 750 752 838 853 
864-865 880 882 886 907 929-930 


diapliragm 


Bio Chain 


DIA002 


387 466-467 


endothelial 
cells 


Stratagene 


EDT001 


5-6 8-9 12 17 21 23 30 37-38 40-41 45-47 49- 
51 58-60 63 70-71 77-80 83 85-86 88 93 96 101 
108-109 114-115 117-118 122-123 133 139 148 
150 153 159-160 165 169 178 182 185 187 207- 
208 213 223 225 228 233 239-240 243-244 248 
255-256 261 270 273 279-281 287 292 297 
299-300 309-310 312-313 332 336 353 355 360 
363 365 378 392 396-398 400 404 406-407 
410-412 415 421 432-433 435 437 439 441-442 
444 452 466-467 492-493 502-504 544 548 555 
569 581 585-586 592-593 604 612 615 635-638 
645 662 685 689 691 706-707 715-716 731 734 
747-748 754 777 783-784 800 815 819 832 852 
856 879-880 883 886 888 890 893 896 904 906 
913 922-923 929-930 953 966 971 


esophagus 


BioChain 


ESO002 


761 


fetal brain 


Clontech 


FBR001 


21 84 146 177 191 198 294 296 312 355 362 
480 510 630 659 740-742 764 769 781 784 886 
906 953 


fetal brain. 


Clontech 


FBR004 


23 49 60 135-136 148 268 272 278 281 298 384 
390 426 462 482 634 655 887 899 


fetal brain 


Clontech 


FBR006 


5-9 13-14 16 19 21 33-35 38 41-44 47-48 50-52 
54 60-63 65 70 75-78 82-83 85 89 100 107 110 
114 116-117 123 130-132 135-136 138 144 
146-147 152-153 159-160 164 169 179-180 
187-188 190 197-198 201 213 215-216 225 228 
231 233 238 242 248 251-253 255-256 258 
260-261 264 266-271 273 276 284 286-287 289 
297 301-302 309 311 313 318 321 329 337 341 
343 346 348 353 365 370-371 373 375 378 383- 
384 390 397 399 415-416 422 426-427 434 437 
440 444 446 452 462-463 466-467 469 471 475 
482 489 491 496-497 502-504 506 509 51 1 521 
528 533 537-538 546 548-549 563-564 566 574 
581 585-586 588 590 592-593 599-604 606-607 
612 627-628 633 636 645 651 659 662 668 673 
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677 689 705-706 709 711 715-716 729 732 
740-742 744 750 754 758 762 768 772 777 779 
783 789 795-800 812 818-819 826 829 832 843 
856 867 872-873 875 877 879 882 887 890 895 
903-904 909-910 919-920 922-923 927-928 
940-942 957 970-971 


fetal brain 


Clontech 


FBRs03 


138 200 822 944-945 


fetal brain 


GIBCO 


HFB001 


5 7 16-17 21 23 38 40 43 47 49 58-59 68-71 75- 
76 78-79 83-85 89 96 102 106 108-110 112 
117-118 120 126-131 133 135-152 154-156 
159-161 166 169 185 196 213 219 231 243 245- 
246 254-256 260 263 270 273-274 279 303-305 
313 324 341 352-353 359-366 371 381 397 
406-407 415 423 454 461 467 483 490 537 548 
551 569 574 585-586 604 606 636 639 645 648- 
649 662 670 683 685 691 706 708 740-742 746 
751-752 754 758 761 768 774 798 831-832 861 
864-866 879 883 887 902 906 908 911 918 944- 
945 948 953 958 970-971 


fetal brain 


Invitrogen 


FBT002 


13 21 41 60 65 70 79 101 114 130 152 155 169 
178 180 185 187 197 229 243 245 270 273 277- 
279 281 287 290 293 297 306 309 322 324 335 
342 352 358 395 423 458-462 466 514 548-549 
551 575 588 606 612 645 685 706 714 746 754 
829 880 886-887 


fetal heart ■ 


Invitrogen 


FHR001 


5-6 17 24 28 47-49 52 54 60 65-66 71 75-78 80 
83 85 89 96 98 107 120 126 135-136 140 146- 
148 153 158 163-166 168 170 173 186 209 213- 
214 218 221 229 231 233 235-236 243 248 251 
255-256 259-260 265 267-270 276-277 284 288 
296 306-307 315-317 329 33 1 334 337 344 353 
371 373 376 378 388 396 408 415 421 425-426 
441 444 450 452 464 466 468 475 479 497 499 
505 508 528 533 536 541-543 551 568 571 577 
585-586 604-606 641-643 662 664 683 688 732 
734 740-742 754 772 801-806 815 821-823 
831-832 855 863-865 873 875-876 881 883-887 
891 898 904 906 916 928-930 942 948 953 957 
971 


fetal kidney 


Clontech 


FKD001 


59 65 70 83 118 164 169 194 213 225 235 260 
278 282-283 300 331 336 353 416 457 479 542 
714 740-742 754 893 917-918 953 971 


fetal kidney 


Clontech 


FKD002 


8-9 11 21 23 48 63 65 77-78 83 100 105 107 
116 121 131 133 135-136 140 147 155 164 172 
188 213 218-220 243 252 254 257 270 276 284 
287 292 297 326 338 348 353 366 376 388 406 
408-409 418 446 466-468 489 496 505 528 541 

<AA ^9 ^71 601 61? 699 6*34 647 6S6 601 

705 746-748 758 766-767 772 774 781 794 807 
827 831 833-835 837 854 858 861 863 882 891 
906 940 943 957 


fetal kidney 


Invitrogen 


FKD007 


70 563-564 


fetal liver 


Clontech 


FLV002 


1 18 40-41 47 49-50 77 103 113 116 131 202 
223 255-256 445 583 673 760 789 808 


fetal liver 


Clontech 


FLV004 


1-5 8-10 14 52 58 63 67 70 75 79 85 95 100 116 
118 122 124 135-136 148 158 166 171 174 188 



WO 03/029271 



PCT/US02/30474 



121 
Table 1 



Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 








198 200 209 213 225 233 243 280 284 296 302 
307 330 344 353 356 366 371 373 376 389 423 

a r- AAA A A C A C r\ A f S~ A tHO A OCX A C\f\ AC\tC C f\"> C f\A 

435 444-445 452 466-468 489-490 496 503-504 
530 533 541 548 561 581 583 585-586 590 606- 
607 627 633 636 648-649 655 662 690 695-696 
738-739 760 774 805 807 809-811 815 824 831 
838 857-858 862 879 886-887 899 910 917 953 


fetal liver 


Invitrogen 


FLV001 


1 1 1-12 24 38 41 43-44 70 72-74 76 88 107 1 10 
1 12 170 184 203 229 262 278 297 362 365 376 
407 432 442-443 445-448 467 496 548 577 581 
800 883 899 


fetal liver- 
spleen 


Columbia 
University 


FLS001 


1-31 33-38 40-54 58-60 63-80 83-108 110-114 
117 120 122 124 135-136 141 147-148 153 155 
159-160 162 166 168-182 184-185 192 196 
199-202 213 215-217 225-226 229 236-237 244 
253 269 278-280 282-283 288 298 302 304 306 
313 315 322-323 325 329-332 341 344 348 
353-357 363 365 376 378 397 399 406-408 
414-415 417 421 423 426 441-442 445 451-452 
461 466-468 476-477 486 489-491 502-504 530 
533-534 541-544 548 551 556-557 563-564 
567-569 581 585-586 588-589 604 607 612-614 
622 636-638 641-642 644-645 662-663 670 673 
682 684-686 689 691 695-696 704-709 715-716 
734 738-742 744 750 754 758 760-764 769-772 
776 783-786 800 812 815 826-827 829 831 838 
840 848 852 861 864-866 874-875 879-880 
882-883 885-886 896 903-904 906 911 913 916 
922-923 929-930 937 944-946 948-949 953-954 
966 970-971 


fetal liver- 
spleen 


Columbia 
University 


FLS002 


2-6 8-11 13-14 16-17 20-21 24-29 31 36-37 40 
43-44 47 50 52 54 59-60 63-65 67-80 86-90 94- 
98 101-103 106-110 112-113 120 122-124 133 
135-136 138-139 142 148 153 155 158-160 162 
166 169-170 172 174 176 179-180 182 184 188 
191-192 196-197 199 205 211-213 215-216 
218-219 222 225-226 231 233 251 254 260 262 
267 269-270 272 280 282-283 287 290 293 297 
299 301 303-304 306-307 313 316-317 319 
321-322 328-329 331-332 341-342 344 353 
355-357 363 365-366 378 382 386 388-389 391 
406-407 410-412 414-415 421 423 431-432 441 
445 447-448 451-453 457 461 467 474-477 484 
487 490 502 506 518 521 530 542-544 546 548 
551 568-569 574 580 583-593 602 604-605 
612-613 622 629 635-636 638 641-644 652 662 
670-671 682-683 685 688-693 706-716 734 
740-744 747-748 760 /oz 111 1 14 /oj-/oo oUo 
815-816 824 826-829 831-832 838 852 866 875 
878 885 903-904 906-907 913 916 921 925-926 
929-931 937 941 943-946 949 952 970 


fetal liver- 
spleen 


Columbia 
University 


FLS003 


1-5 10 16-17 30 33 60 70 72-74 76 79 88 96-97 
99 108 113 117 122 124 126 153-155 169-170 
172 179 184 186 191-193 199 223 229 239-240 
260 290 297 302 313 332 344-345 350 353 363 
376 387 402 406 445 447-448 476-477 486 516 
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519 541-543 548 555 561 605 613-614 637 652 
740-742 744 754 761-762 765 776-777 783-784 
791 799 819 829-830 842 844 866 879 882 886 
890 916 944-945 


fetal lung 


Clontech 


FLG001 


7 68-70 92 108 166 170 173 180 261 268 273 
353 355 365 498-499 519 868-871 883 886 890 
970 


fetal lung 


Clontech 


FLG004 


60 


fetal lung 


Invitrogen 


FLG003 


1 8-9 36-37 41 43 70 79 107 113 134 148 166 
177 188 202 205 223 243 280 310 316-317 332 
376 407 437 450-451 454 486 522 552 558 605- 
607 631 685 731 761 772 781-782 863 868-871 
899 921 941 951 965 


fetal muscle 


Invitrogen 


FMS001 


24 36 41 70 79 101 113 117 126 148 167 177 
198 201 223 226 265 281 344 350 387 398 409 
428 466 508 559 563-564 575 588 608-610 613 
636 670 707 738-739 828 880 884 903 917 921 
953 971 


fetal muscle 


Invitrogen 


FMS002 


5 13 18 21 23 36 41 44 47 52 63-64 70 87 98 
100 105 110 126 135-136 147 153-154 166 
185-186 213-214 218 220-221 231 236 259 265 
271 280 284 286-287 306 316-317 329-330 338 
341 361 371 373 388-389 394 396 408 421 423- 
424 426 428 441 464 466-467 473 498 508 521 
533 536 541 543 551 558 567 569 583 600 606 
613 622 677 744 748 754 791 794 805 818 831 
855 862 864-865 875 877 881-882 885 927 939 
944,945 960 


fetal skin 


Invitrogen 


FSK001 


1 7 24-25 29 32 36 38 41 63 68-71 75 79 83 85 
88 104-105 108 116-118 122 127 138 142 146 
148-150 154 158 169-170 173 178 182 185 187 
190 197-198 200 204 209 21 1-212 223 233-235 
243 245 251-252 254 261 264 268 270 277 280 
290 297-298 305 316-317 326 330-332 334 336 
341-342 350-351 353 357 376 379 395 401 
406-407 409-412 414 423-424 440 443-444 
450-451 455 457 466 472 489 492-493 507 516 
522 551-552 563-564 572 577 583 587 606-608 
611-614 619 622 627 630 634 643 652 660 671 
675-679 685 691 705-707 731 733 738-742 744 
753 768 772 812 826 830 840 843 848 860 864- 
865 868-871 883-885 899 903 906-907 910 
916-917 928 948 953-954 956 965 


fetal skin ' 


Invitrogen 


FSK002 


1 5-6 23 36 38 42 44 48 54 65 68-69 78 83 85 
94-95 98 109-111 114 116 122 135-136 138 
147 149 155 164 166 173-174 178 182 186-187 
198 201 206 210 213 220 223 225 229 231-232 
237 245 249 255-256 262 268 270 284 288 292 
296-298 302 306 321 337 346-347 350-351 365 
367 370 388 396 403 406 409 423-424 433 439 
450-452 472 479 486 489 492-494 506 528 
533-534 541 543 546 548 552-553 556-558 566 
571 577 583 608-609 622 627 643-644 658 675 
705-706 720 738-742 748 754 763 773 776 791 
812 822 826 830-831 837-843 854-855 861 
864-865 875 879 882-883 885 891 895 899 904 
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906-907 910 916 928 937 942 944-945 948 958 


fetal spleen 


BioChain 


FSP001 


6 24 352 


fibroblast. 


Stratagene 


LFB001 


5-6 31 36-37 45-46 49 59 65 70 96-97 109 134 
144 146 150 155 186 192 196 226 254 257 273- 
274 279-280 327 353 371 397 407 410-413 415 
417 466-467 541 612 629 643 673 685-686 688 
691 697 704 776 815 817 826 829 864-865 879 
911 919 942 948 953 965 


induced 
neuron-cells 


Stratagene 


NTD001 


15 19 23 48 59-60 70 89 96 105 113 120 128 
133 137 146 148 163 197 213 236 245 295 315 
331 353-355 371 395 451 466 500 541 551 644 
674 681 685 695-696 698 715-716 763 807 854 
866 940 944-945 948 958-959 966 


infant brain 

• 


Columbia 
University 


IB2002 


5 7-9 13 21 24 28 36 38 40 47 55 58-61 63 65- 
66 68-69 71 75 80 84-85 88 93 107 109-110 
115 122 126 128 130-131 135-138 142 146 153 
155 161 166 172 180 182 184 187 197 200 203- 
204 211-213 218 220 222-223 225 228 245 
260-261 263 270 274 278 280 282-283 290 296 
303-307 309 316-317 320 322 331-332 334-335 
358 362 365 367-370 378 382 384 389 401-402 
406-407 416-417 423 437 442-443 450 455 459 
462 471 479-483 489-490 519 548 554 563-564 
569 575 578 583 592-593 599 612 625 639-640 
659-660 685 691 706 731 740-742 750 754 768 
784 791 800 819 825 829-830 862 866 879 882- 
886 903 908 911 916 918 925-926 942 944-945 
962 966 970-971 


infant brain 


Columbia 
University 


IB2003 


8-9 21 30 34-36 38 41 47 51 58-60 70 75 78-79 
85-86 92-93 96 107 110 115-116 118 127-128 
135-136 139 142 146 149-150 153 155 159-160 
164 166 174 176 180 182 187 210 213 218 220 
222 228 245 250 252 260 269-270 273-274 277 
280-281 296 304 312-313 316-317 322 326-327 
332 334 341 344 353 358 361-362 365 367-370 
401 407 415-416 421 423 441-442 451 458 471 
479 482-483 488-489 503-504 507-508 510 
563-564 569 571 583 606 670 685 702-703 736 
750 768 776 785-786 788-789 798 812 819 831 
840 851-853 860 863 866 873 879 883-885 887 
890 903 909 911 916 918 938 940-941 944-945 
962 966 970 


infant brain 


Columbia 
University 


IBM002 


5 8-9 28 84 135-136 210 309 316-317 401 442 
546 577 685 734 838 860 885 


infant brain 


Columbia 
University 


IBS001 


36 38 59-60 65 86 108 1 10 137 139 180 245 
314 320 353 450 456 563-564 798 825 831 970 


leukocyte 


Clontech 


LUC003 


12 21 37 50 59 70-71 79 99 148 155 161 174 
222 234 236 284 365 455 4oo 516 556-557 56J- 
564 588 637 726 740-742 887 905 907 910 941 
971 


leukocyte 


GIBCO 


LUC001 


1 5 12-13 17 21 23-24 33 36-37 41 43-46 48-49 
52 59 70-71 76 78-79 85 87-88 96 99-100 108 
110 116 118-119 122 124 127 133-134 141 
146-148 155 159-161 163-164 166-167 169 
172-174 187-191 194 197-198 202-207 213 
224-226 228-229 231 234 236 238-249 251-258 
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260 263 271 280 284-289 295 298 304 309-311 
313 315-317 335 341 353 357 359 365 371 387- 

iqq a ao Af\£. zinc Af\Q A17 AO'X AA0-AA% 44R 
iOO 4UZ 4U0 4U5-4Uy 41 / HAD HHArHHD hho 

466-468 479 489-490 492-493 496 501 503-504 
532 541 546 548 551 556-557 563-564 568-569 
571 584 605-606 612 615 622 635-637 641-649 
653 655 662 691 693 697 706-707 715-716 718 
732 734 740-742 747-748 769 772 783-784 800 
812 826 831 840 866 876 882-883 896 899 903- 
904 906 909-910 921 925-926 933-936 938 940 
947-948 953 955 971 


lung tumor 


Invitrogen 


LGT002 


6-7 12-13 20-21 23 25 29 36-37 41-42 45-48 51 
55 63 65 71 78-79 85 96 99 110 113-114 117 
122 139 148-149 153-155 159-160 162 166 
169-170 174 182 184-185 187 192-194 200 202 
204 209 219 224-227 232 235 237 252 255-256 
268 270 277-278 290 295 297 299-300 304-305 
308-310 316-317 322 329-331 335 341 344 346 
353 365 371-372 375 380 386 392 394 396 400- 

a r\n ac\£L A{\h /i 1 n /MO All All A1Q AAA A^O 
402 4UO-4U/ 41U-41Z 4Z3 hd l hd? 444 4Z)Z 

466-468 470 475-478 487-489 503-504 542 544 
546 548 551-552 556-557 563-564 579 581 588 

riO fzic. (ZAI &A*\ 6zlR 6AQ 669 66R-671 67^ 
OIZ OjI OOO 04J-04D OHo-OHy OOZ DDO-U/ 1 \J / D 

670 681 6R^ 6RS 601 706 71 S-71 6 740-743 
751-752 754 768 772 783 798 812 819 826 828 
831 840 843 852 854 857 863 877 879 882-883 
886 890-891 895 906-907 925-926 933-935 938 
948-951 953 957 970-971 


lymph node 


Clontech 


ALN001 


25 44 60 70 76 96 125 138 187 315 340 353 
397 446 467 489 707 740-742 879 931 944-945 
971 


lymphocyte 


CA-46 


DGD001 


37 83 159-160 440 650 787 900 


lymphocytes 


ATCC 


LPC001 


1 5 12-13 23 25 31 33-37 42-44 48 54 58-59 71 
76 78 85 89-90 99 116-117 119 122 124 147- 
148 153 161 165-166 169 174 181-182 187 189 

i a i tnA 1 c\c ia/c oao Tin n/c ooo ooo ia 1 O/tl 
191 194-19 J zUo-ZU / Z1U ZZO-ZZ / ad / ahi aHd 

0<M O^O 06fk 061 06C 9*79 97*1 9R0 9RA 9R7 9RR 
AD L-ADa ZOU-ZOl ZOo Z/Z AID Ao\J ZO*t ZO/-Z00 

301 306 311 316-317 322 327 338 348 355 357- 
358 365 371 377 381 389 407 422-423 433 450- 

A^1 A^ AfH A%1 4R6 400 S01 S0VS04 S41-S43 

HD 1 HjJ HO I HOD HOO Hy\J D\J L D\JD"D\J'-r Jtl — J*T J 

546 548 556-557 568-569 607 643-644 662-663 
670 688 691 694-697 706 718 734 740-743 789 
831 840 846-847 850 861-862 864-865 883 887 
903-904 908 910 933-934 944-945 955 970 


lymphocytes 


RPMI-8226 


DGD004 


159-160 


macrophage 


Invitrogen 


HMP001 


21 44 63 112 119 153 190 211-212 222 225 248 
255-256 365 388 401 406 450 459 463 630 644 
705 826 882-883 886 956 


mammary 
gland 


Invitrogen 


MMG001 


5 7 12-13 19-21 25 29 36-38 41-42 44 47 51-52 
59 65 68-71 76-77 79 86 89-90 101 104 107 
110 112 116-117 122 127 131 135-136 138 142 
148-149 153 166 169-170 172-174 177 180 182 
184 191 204 21 1-212 223 225 235 243 248 252 
256 260-261 267 270-271 276-277 280 296-297 
302 305-306 309 313 316-317 331-332 335-336 
340 344 351-352 365 371 377-383 387-388 390 
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Table 1 



Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 








400 404 406 409-412 414 423 426 443-444 450 
452 455 457 466 470 489 491-494 499 511 522- 
524 548-549 555-557 563-564 568 575 577 581 
588 592-593 606 612 636 643 645 659 670-671 
677 685 706-707 711 718 724 730-731 734 
740-742 744 748 757-758 761 766-768 772 783 
785-786 800 812 816 824 829 838 840 848 852- 
853 857 861 868-871 880 882-883 885-886 899 
903 906 916 921 929-930 933-935 938 944-946 
951 970-971 


melanoma 
from-cell-line- 
ATCC-#CRL- 
1424 


Clontech 


MEL004 


5 17 21 31 47 71 78-79 99 112 114 117 122 124 
131 159-160 170 172 201 215-216 219 243 
316-317 353 422-423 466-467 494 533 543 567 
570 585-586 707-708 812 826 831 858 863 879 
893 895 916 919-920 952 955 959 970 


mixture of 1 6 
tissues/mRNA* 


various 
vendors 


SUP002 


1 7 24 36 38 44-46 60 66 70 89 1 19 127 169 
182 194 205 210 218 221 223 227 229-230 235 
255-256 270 273 277 281-283 286 293 296 302 
309 347 353 365 426 444 456 462 466-467 494 
553 592-593 601 630 634 651 677 685 705 718 
840 885 896-899 902 910 91o 928 931 94o 


mixture of 1 6 
tissues/mRNA 


various 
vendors 


SUP004 


495 


mixture of 1 6 
tissues/mRNA 


various 
vendors 


SUP007 


38 


mixture of 1 6 
tissues/mRNA 


various 
vendors 


SUP005 


1 148 221 365 512 516 634 776 833-835 879 
912-914 941 


mixture of 1 6 
tissues/mRNA 


various 
vendors 


SUP008 


1 5 13 38 61 117 135-136 155 161 172 243 250 
290 401 466 703 911 


mixture of 16 
tissues/mRNA 


various 
vendors 


SUP009 


1 38 70 96 1 10 243 307 768 81 1 939 953 956 
971 


neuronal cells 


Stratagene 


NTU001 


13 21 45-46 60 68-70 83 89 107 139 148 178 
197 207 218 220 248 251 284 309 409 526 541 
563-564 585-586 604 633 660 677 685 699-700 
715-716 726 752 761 763 774 776 782 844 858 
863 872 877 882 899 914 940 950 


pituitary gland 


Clontech 


PIT004 


28 31 59 68-69 79 85 89 98 101 108-109 115 
127 134 150 159-160 165 169 198 218 231 235 
260-261 273 297-298 313 322 327 330 353 364 
367 395 402 459 466 541 568-569 574 641-642 
670 691 734 757 831 883 906 911 916 919-920 


placenta 


Clontech 


PLA003 


5-6 20 47 55 59 63 71 83 85 99 108 116 118 
124 135-136 138 140 153-154 158 161 171 174 
182 225 243 270 279 284 287 297 302 334 349 
353 389 391 393 409 444 447-448 467 488 490 
496 539 581 592-593 601 622 634 656 709 720 
738-742 765 787 799-800 812-814 820 824 829 
831 839 844-845 ool-ooo o/o ooz-oo4 yil) ylo 
919-920 942 948 952 958 


placenta 


Invitrogen 


APL002 


12 24 37 68-70 82 102 110 117 174 208 280 
349-352 423 492-493 548 800 921 


prostate 


Clontech 


PRT001 


6 13 20 23 25 37 42 49-50 65 70-71 76 85 107 
1 17 150 162 166 203 215-216 219 254 270 277 
279 288 297 301 305 307-308 310 321 338 353 
366 376 379 402 414 466-467 469 525-528 567 
580 585-586 588 602 648-649 673 686 706 718 
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Table 1 



Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 








726 735 761 768 787 820 829 831 852 8oo 8/y 


rectum 


Invitrogen 


REC001 


6-7 18 43 66 79 96 1 13 138 148 loo 170 182 
202 207 321-322 341 356 414 443-444 450 466 
470 484 489 491-493 530 588 615-619 707 734 
766-768 793 807 848 868-871 880 883 916 921 
928 938 


retinoic acid- 

induced- 

neuronal-cells 


Stratagene 


NTR001 


5 21 23 25 52 62 70 83 96-97 120 123 148 154 
180 194 271 306 310 314 389 489 524 541 572 
606 686 690 710 726 800 862-863 882 


saliva gland 


Clontech 


SALs03 


66 


salivary gland 


Clontech 


SAL001 


5 28 30 60 63 65 70 79 107-108 117 126 130 
133 135-136 144 148 167 172 196 208 213 270 
280 287 294 326 353 365 380 4d / 4o / Dii j / j 
62U-OZ3 o3 j-d30 o4o-o4y / 1 j- / 1 o /o / 0Z0 oJo 
868-871 904 956 965 971 


skeletal muscle 


Clontech 


SKM001 


43 48 66 75 79 1 17 126 146 155 167 197 201 
214 217 219 221 225 247 254 259 265 297 301- 
302 309 329 387 394 428 497 598 638 691 724 
752 757 829 864-865 881 883 948 


skeletal muscle 


Clontech 


SKM002 


387 


skeletal muscle 


Clontech 


SKMs03 


387 


skin fibroblast 


ATCC 


SFB001 


89 387 


small intestine 


Clontech 


SIN001 


5-7 12 17-18 21 24-25 27 44-46 58-60 62-63 
70-71 77-78 87-88 96 99 101 109 111 117 122- 
124 135-136 140 148 150 161 166 172 174 176 
186 191 194-195 201 209 213 221 223 225 228- 
229 234 243 246 260 273 280 284 293 297 300- 
302 306-308 310 313 326 331-332 340-343 347 
353 357 362 365 371 378 393 404 416 432 445- 
448 459 467 474 489 492-493 521 529-531 533 
543 556-557 559 602 605 622 631 638 643 645- 
646 668 670 697 726 740-743 757 761 763 768 
772-775 780-782 784-787 812 829 831 852 
854-857 876 879 885 896 904 906-908 910 921 
931 944-945 948-949 953 958 960 970 


spinal cord 


Clontech 


SPC001 


8-9 12 44 48-49 51 65 70 75 108 113 116-117 
120 122 127 135-136 150 155 159-161 164 166 
174 182 185-186 188 219 234 260 273 276 278 
286 290 292 301 309 326 337 347-348 353 362 
365 378 405 420 429 451 469 474 479 491 499 
532-537 542-543 563-564 636 645 648-649 673 
683 686 691 708 718 720 726 740-742 758-759 
768 772 784 787 800 824 829 832 873 885-887 
911 944-945 953 957-958 962 


stomach 

* 


Clontech 


STO001 


6-7 34-35 59 65 70 72-74 172 237 255-257 293 
307 353 371 402 410-412 569 600 636 673 866 
868-871 879 906 931 955 970 


thalamus 


Clontech 


THA002 


5 13 28 59 63 77 79 1 15 1 17 122 127-128 138 
142 146 182 187 190 197 205 220 252 307 309 
328 336 344 367 386 421 435 437 489-490 497 
508 538-539 548 581 615 685 691 705 726 756- 
757 759-760 768 826 879-880 883 886-887 896 
902 921 958 971 


thymus 


Clontech 


THM001 


5-6 17 44 47 49-50 52 59 68-71 85 87-88 96 
100 104 117-118 122 157 159-161 164 169 



WO 03/029271 



PCT/US02/30474 



127 
Table 1 



Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 








173-174 179 186 197 209 215-216 220 225-226 
254 273 280-281 284 288 298 306 310 331 344 
348 352-353 364 406 414 432 442-443 466-467 
475 479 489 492-493 498 528 541-542 552 
556-557 559 567 571 576 580 604 606 612 628- 
630 648-649 673 693 697 734 743 757 760 768 
776 784 800 824-826 832 848 861 864-865 886 
904 907 921 941 953 962 970 


thymus 


Clontech 


THMc02 


5-6 12 14 19-21 23-24 37 43-44 47 49-50 54 60 
65 70-71 85 87 95 99 108 110 116-117 119 122 
124 129 131 140 146 148 154 161 166 174 179 
193 197 201 204 206-207 213 220 222 235-236 
242 251-253 255-256 258 260-262 266-268 
270-272 278 281-288 296-298 300-301 326 341 
344 347_348 351 357 359-360 365-366 376 378 
388-389 393 406-408 415 439 444 447-448 466 
468 472 475-477 479 482 489 492-493 502-504 
521 533 543 546 548 550 556-558 560-564 568 
576 585-586 588 591 606 622 636-637 643-644 
650-656 670 673 709 718 738-742 758 773 776 
784 789 796 812 827 829 832 838 849-850 863- 
865 876 886 899 903-904 906 910 916 919-921 
929-930 940-941 948 952-953 958 970-971 


thyroid gland 

• 


Clontech 


THR001 


5 7 12 14 20-21 23-24 28 30 42 45-46 48 50-51 
58-59 61 65-66 70-71 76-80 83 89 93-94 96 98 
102 104 107-109 112 117-118 124 126 129 
134-136 140 144 147 153 159-161 165 167 
172-173 184-187 192 194 196-197 201 208 
215-216 218-220 223 225 235 246 248 253 
260-261 266 268-270 273 280 284 292 297-298 
302 304 310 315 321 330 334 336-337 344 348- 
349 351 353 365-366 378 387-388 392 401 404 
406-408 410-412 424 429 437 444 466-467 469 
472 475 479 485 487 491 500 519 533 540-544 
548 551 572 581-583 602 606 609 622 636 641- 
645 648-649 659 673 681 697 702 705-706 711 
715-716 740-743 754 757 764 769 772 787 789 
815 824 828-830 848 852 856 864-866 868-873 
875-876 879 882-883 885-891 902 904-906 908 
918-923 929-931 940 944-945 948 956 962 966 
971 


trachea 


Clontech 


TRC001 


7 48-50 59 75 79 85 1 17 122 153 166 173 185 
220 243 267 270 296 371-372 406 414 419 429 
432 467 492-493 501 569 573-574 582 697 
734-735 744 796 819 829 868-871 876 879 
890-892 903 


umbilical cord 


BioChain 


FUC001 


1 5-7 18 21 25 31 43 45-46 51 59 70 75 83 88- 

oq o/r no 1 AC i AO 1 HQ 11S 117 11R 1 90 199 1 ^ 

138 150 155 166-167 174 182 185 190-191 196 
200-201 218 220-221 223 228-229 235 241 254 
270 273 279 281 287 291 297 307 315 330 334 
336 342 349 353 363 365 372 406 410-412 423 
441 466-467 469 475 490-493 521 526 548 
551-552 561 584 611 622 629-630 637 645 
680-683 685 691 707 718 726 750 755-756 768 
775 800 831 838 844 855 878-881 883-885 893 
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Table 1 



Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 








004 007 071 Q90-Q10 951 965 971 


uterus 


Clontech 


UTR001 


17 19 21 62 70 96 118 149 165-166 184 186 
188 235 248 264 279 300 353 371-372 423 466 
469 492-493 502 545-546 588 602 629 631 638 
648-649 673 751-752 829 832 864-865 891 
893-894 917 948 


young liver 


GIBCO 


ALV001 


2-4 10 21 38 40 51 70 76 113 117 147 202 243 
270-271 275-276 287 306 341 457 479 491 547 
637 662 867 906 921 970 



*The 16 tissue/mRNAs and their vendor sources are as follows: 1) Normal adult brain mRNA 
(Invitrogen), 2) Normal adult kidney mRNA (Invitrogen), 3) Normal fetal brain mRNA (Invitrogen), 4) Normal 
adult liver mRNA (Invitrogen), 5) Normal fetal kidney mRNA (Invitrogen), 6) Normal fetal liver mRNA 
(Invitrogen), 7) normal fetal skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) Human 
bone marrow mRNA (Clontech), 10) Human leukemia lymphoblastic mRNA (Clontech), 1 1) Human thymus 
mRNA (Clontech), 12) human lymph node mRNA (Clontech), 13) human so\spinal cord mRNA (Clontech), 
14) human thyroid mRNA (Clontech), 15) human esophagus mRNA (BioChain), 16) human conceptional 
umbilical cord mRNA (BioChain). 
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Table 2A 



SEQ 

JLJL) 


Hit ID 


Species 


Description 


S 


Percent 

IUC1IU 1/ Y 


972 


gi38223 


Pan troglodytes 


alphal-globin 


395 


100 


972 


gi38225 


Pan troglodytes 


alpha2-globin 


395 


100 


972 


gil433oool 


Homo sapiens 


lop 13.3 sequence section i oio. 


DyD 


i nn 

1UU 


973 


gil293093 


Mus musculus 


• 

argmase 


ooy 


OJ 


973 


gi202979 


Rattus 
norvegicus 


arginase 


853 


84 


973 


gi202981 


Rattus 
norvegicus 


liver arginase (E.C. 3.5.3.1) 


853 


84 


974 


gil 197498 


Homo sapiens 


H. sapiens arginase gene exon 1 and 
flanking regions (EC 3.5.3.1) (and 
joined CJJbj. 


1 1 CO 

1 1 JO 


1 nn 
1UU 


974 


A A "D AC7 A fZ 

AAKUjjOo 


T T * 

Homo sapiens 


Human argmase. 


I 1 

I I Jo 


1UU 


974 


gil78995 


Homo sapiens 


Human liver arginase rnRNA, 
complete cas. 


1155 


99 


975 


gil3529083 


Homo sapiens 


Similar to arginase, liver, clone 
MGC:12405 IMAGE: 3 934479, 
niKiNA, complete cas. 


778 


84 


975 


gi2047344 


Rattus 

• 

norvegicus 


arginase II 


497 


60 


975 


gi497232 


Xenopus laevis 


arginase 3 


486 


61 


976 


gil4211500 


Homo sapiens 


secretory protein SEC8 mRNA, 
complete cds. 


4912 


100 


976 


gil4042555 


Homo sapiens 


cDNA FLJ14782 fis, clone 
NT2RP4000524, highly similar to 
Mus musculus Sec 8 mRNA. 


4912 


100 


976 


AAB93175 


Homo sapiens 


Human protein sequence SEQ ID 

"X. T/""\ 1 O 1 1 /I 

NO:121 14. 


4912 


100 


977 


gil 1999092 


TT 

Homo sapiens 


TGF beta inducible nuclear protem 
lUNF 1 (llJNrl) mKJNA, complete 
cds. 


599 


91 


977 


gil4091475 


Homo sapiens 


hairy cell leukemia protein 1 
mKiNA, complete cas. 


599 


91 


977 


AAB39308 


Homo sapiens 


Human secreted protein sequence 
encoaea oy gene ho oii^ iu 
NO:188. 


599 


91 


y / o 


rri 1 1 A 
gil / 1H- 


iNepmid ciavipes 


aragime siik noroin 


ZUJ 




978 


gi4519619 


Haliotis discus 


collagen pro alpha-chain 


146 


28 




gl /O /UUDU 


A.enopus laevis 


type 1 collagen aipna 1 


1 &c\ 
1 ou 


9£ 
ZO 


979 


gil 657752 


Homo sapiens 


Human FE65-like protein (hFE65L) 
mKJNA, partial cds. 


3648 


96 


979 


AAY13459 


Homo sapiens 


Amino acid sequence of human 
Fe65-like protein. 


3648 


96 


979 


gil3377732 


Rattus 

« 

norvegicus 


FE65 


1495 


50 


980 


gil657752 


Homo sapiens 


Human FE65-like protem (lirE65L) 
mKJNA, partial cds. 


2Ui / 


/ J 


you 


A AV1 1A$Q 


XiVJllAU odJJlCllo 


A m inn n^irl cpnnPTipp r»"f rnim3T1 

Fe65-like protein. 


\j i 


/ ~j 


980 


gil 3377732 


Rattus 
norvegicus 


FE65 


1520 


46 


981 


gi458542 


Homo sapiens 


H.sapiens mRNA for orphan nuclear 
hormone receptor. 


437 


98 


981 


AAW32536 


Homo sapiens 


Constitutively active receptor-alpha. 


437 


98 


981 


AAW93902 


Homo sapiens 


Human CAR receptor protein. 


437 


98 


982 


gil5157720 


Agrobacterium 


AGR C 4595p 


208 


26 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


Percent 
identity 






tumefaciens 








982 


gi4520316 


Sphingomonas 
paucimobilis 


OH-DDVA meta-cleavage 
compound nyaroiase 


202 


25 


982 


gi8777583 


Sphingomonas 
paucimobilis 


4-oxalomesaconate hydratase 


191 


29 


983 


gil4043978 


Homo sapiens 


interferon stimulated gene (20kD), 
clone MCjC:141Uj 

IMAGE:4309120, rnRNA, complete 
cds. 


427 


98 


983 


gi6759541 


Homo sapiens 


H. sapiens mRNA for gpISG20 
protein. 


427 


98 


983 


gi2062680 


Homo sapiens 


Human HEM45 mRNA, complete 
cds. 


427 


98 


984 


gil2804251 


Homo sapiens 


clone MGQ2652 IMAGE:3535451, 

T*» \T A 1 i J 

mRNA, complete cds. 


500 


100 


984 


AAG02107 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: oloo. 


297 


98 


984 


AAB95667 


Homo sapiens 


Human protein sequence SEQ ID 

\rn 10A/IO 

NO:18448. 


71 


28 


985 


gi7023926 


Homo sapiens 


cDNA FLJ1 1336 fis, clone 
PLACE 1 010661, weakly similar to 
TESTIS-SPECIFIC PROTEIN 

"DO PI 1 


774 


50 






Homo sapiens 


Human protein sequence onv^ iu 
NO: 13262. 


HI A 
I /4 






glo4uo0lD 


Homo sapiens 


1 L/r 11 ^ 1 Cr 1 1 ) mKJN A, complete 
cds. 




40 




AAK./4ZIO 


Homo sapiens 


Human death associated protein 

A T> O 

IJAJr-Z. 


TOT 


z / 




A A XX T1 1 Q /^'7 

AA W / 130 1 


xiomo sapiens 


jjeatn associatea protem-z {Dj\r- 1 , 
DAP-kinase). 


ZO / 


z / 




gl2Uy4o 15 


xiomo sapiens 


xi. sapiens u/vr-Kinase miviN j\ . 


ZO / 


z / 


987 


gil2584159 


Homo sapiens 


zinc finger protein 268 (ZNF268) 
mKiNA, complete cas. 


1723 


54 


yo / 


i nom a ^ 
gllUZU14D 


Homo sapiens 


Human UlN A Dinamg piotem 
(HPF2) mRNA, complete cds. 


1 AQ7 
1 Oo / 




Vo / 


C/l A 1 1 C 
gO44l0lD 


cams iamiiiaris 


zinc linger protein 


1 AQ1 

i4y^ 




988 


gi895921 


Xenopus laevis 


nucleolar phosphoprotein 


152 


21 


no o 


^.,*/cn 1 no 1 

gioUlyi 1 


Oryctolagus 
cuniculus 


neurofilament-H 


1 A ^ 

14j 


zz 


no o 

988 


gi5901659 


Caenorhabditis 
elegans 


■vnvTTl 1 

XlMP-1 


1 A Q 

14o 


Z4 


n o n 

989 


AAY70539 


Homo sapiens 


Human Factor 8 Homologue. 


z 13 


3D 


989 


AAB64627 


Homo sapiens 


Human secreted protein BLAST 
searcn pro tern !SiiQ ID NO: 1J / . 


144 


29 


non 


AABo4ozo 


Homo sapiens 


Human secreted protein BLAST 


144 


zy 


990 


AAB56613 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1191. 


1120 


100 


990 


gil82642 


Homo sapiens 


Human rapamycin-binding protein 
(FKBP25) mRNA, complete cds. 


1109 


96 


990 


gil 82626 


Homo sapiens 


Human rapamycin binding protein 
(FK506) mRNA, complete cds. 


1061 


100 


991 


gi 127464 10 


Mus musculus 


coenzyme A diphosphatase 


357 


60 


991 


gil0764850 


Arabidopsis 


F1K23.5 


127 


33 
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SEQ 
ID 


Hit ID 


Species 


^^^^k mtm mm-+ _ ^ Am mm jTm, mm 

Description 


c 

score 


X CI CCll L 

identity 






. i 1 • 
thahana 








991 


gi5824849 


Caenorhabditis 
elegans 


Yo /CjzA.14 


i nn 


D 1 


992 


gil4348900 


Homo sapiens 


heat shock protein niRNA, complete 
cds. 


2122 


90 


992 


gi5 17065 


Homo sapiens 


Human chaperonin protein (Tcp20) 
gene complete cds. 


2122 


90 


992 


AAB56638 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 121 6. 


2122 


90 


993 


gi3298472 


Mus musculus 


zinc finger protein 


828 


91 


993 


gi4884374 


Homo sapiens 


mRNA; cDNA DKFZp586B1222 
(from clone DKFZp586B1222); 
partial cds. 


194 


42 


993 


gil254984 


Rattus 

• 

norvegicus 


AKAP95 


194 


42 


994 


gil4042531 


Homo sapiens 


cDNA FLJ14769 fis, clone 
NT2RP3004189, weakly similar to 
VEGETATIBLE 
INCOMPATIBILITY PROTEIN 
HET-E-1. 


3327 


99 


994 


gil4424653 


Homo sapiens 


clone MGC: 16757 
IMAGE:41ilozj, mKJNA, complete 
cds. 


3327 


99 


994 


AAB95791 


T T 

Homo sapiens 


Human protein sequence SEQ ID 
NO: 18751. 


DOZ 1 


QO 

yy 


995 


gi2736284 


Mus musculus 


Ldbla 


1^1 A 


QQ 

yy 


995 


gi2582522 


Gallus gallus 


neural src interacting protein, long 
form; NSIP long form 


2206 


98 


995 


gi5123791 


Homo sapiens 


partial NLI gene for Nuclear LIM 
interactor, exon 2-11. 


2175 


100 


997 


gi 13560797 


Homo sapiens 


ubiquitin specific protease mRNA, 
complete cds. 


Ool / 


1 nn 

1UU 


997 


gi37334 


Homo sapiens 


H. sapiens mRNA for tre oncogene 
(clone Z13). 


4510 


93 


1*"V /"V ^"1 

997 


gi5360127 


TT * 

Homo sapiens 


N Y-KbJN-oO antigen mKJN A, partial 
cds. 


A A OO 


QQ 

yy 


998 


AAB51702 


TT * 

Homo sapiens 


Human secreted protein sequence 
encoded by gene 32 SEQ ID 
NO: 142. 


d>y 1 


CO 


998 


gi5 070621 


Homo sapiens 


retrotransposon LI insertion in X- 
linked retinitis pigmentosa locus, 
complete sequence. 


364 


42 


998 


gi339776 


Homo sapiens 


Human transposon LI. 2. 


364 


42 


999 


gil5157181 


Agrobacterium 
tumefaciens 


AGR_C_3718p 


742 


51 


999 


gi 14022240 


Mesorhizobium 
loti 


probable D-lactate dehydrogenase 


734 


50 


999 


gil5075160 


Sinorhizobium 
meliloti 


PUTATIVE D-LACTATE 
DEHYDROGENASE 
(CYTOCHROME) PROTEIN 


713 


50 


1000 


gil4603247 


Homo sapiens 


Similar to RIKEN cDNA 
5730409G15 gene, clone 
MGC: 19636 IMAGE: 2 8223 23, 
mRNA, complete cds. 


368 


91 


1000 


AAB36613 


Homo sapiens 


Human FLEXHT-35 protein 


368 


91 
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ID 


Hit ID 


bpecies 


jjescnption 


s 

score 


identity 








sequence SEQ ID NO:35. 






1000 


gi7022832 


TT ' 

Homo sapiens 


«T\XT A T7T T 1 A£ < 1 -fin /-i1/-*-n/a 

cDJNA rLJ lUool lis, clone 
NT2RP2006106. 


zzu 




1001 


gi4884l42 


TT „ _ * 

Homo sapiens 


mKJNA; CDJNA DifJfZjpDoorZZly 

(from clone DKFZp586P2219). 


ou / 


QQ 


1001 


gil342l40l 

* 


Caulobacter 
crescentus 


Mut 1 /nuaix lamily protem 


1 *X1 




1001 


gil3276687 


TT - _ „ 

Homo sapiens 


mKJNA; cDJNA DJsJ^Zp /Olll /z 
(from clone DKFZp761I172); 
complete cds. 


1 1 c 
110 


^ 1 

O 1 


1002 


gi4689229 


Rattus 
norvegicus 


b-tomosyn isoform 


2075 


94 


1002 


gi3790389 


Rattus 
norvegicus 


m-tomosyn 


1976 


95 


1002 


gi468923l 


Rattus 
norvegicus 


s-tomosyn isoform 


1896 


94 


1003 


AAB29645 


Homo sapiens 


Human membrane-associated 
pro tern Jriuivi/\jr-z. 


311 


83 


1003 


gil 196425 


Homo sapiens 


Human endogenous retrovirus 
xiJv v jj poi-env-j ju i region. 


309 


71 


1003 


AAB95392 


Homo sapiens 


Human protein sequence SEQ ID 

"MTV 1 I'lA'X 


273 


77 


1004 


AAY21631 


Homo sapiens 


Ligand binding domain of nuclear 
receptor niKoeta. 


524 


100 


1004 


gi31207 


Homo sapiens 


Human c-erb-A mRNA for thyroid 
normone receptor. 


497 


100 


1004 


gil80253 


Homo sapiens 


Human c-erbA mRNA, complete 
cds. 


497 


100 


1005 


gil3097597 


Homo sapiens 


clone IMAGE:3542589, mRNA, 
partial cas. 


2967 


100 


1005 


gi2370155 


Homo sapiens 


Homo Sapiens mRNA for 
spinocerebellar ataxia 7. 


1031 


37 


1005 


gi3 192954 


Homo sapiens 


ataxin-7 (SCA7) mRNA, complete 
cds. 


1029 


37 


1006 


gil3097597 


Homo sapiens 


clone IMAGE:3542589, mRNA, 
partial cas. 


1853 


99 


1006 


gi3 192954 


Homo sapiens 


ataxin-7 (SCA7) mRNA, complete 

COS. 


1270 


40 


1006 


gi2370155 


Homo sapiens 


Homo Sapiens mRNA for 
spinocerebellar ataxia 7. 


1268 


40 


1007 


gi5924306 


Plasmodium 

r 1 • 

ialciparum 


AD A2 -like protein 


410 


30 


1007 


gil70991 


Saccharomyces 
cerevisiae 


ADA2 


396 


31 


1007 


gi927705 


Saccharomyces 
cerevisiae 


Ada2p: probable transcriptional 
adaptor; YDR448W; CAI: 0.12 


390 


1 1 
31 


1 nno 
lUUo 


gliUj4 /4 / 


riomo sapiens 


n. sapiens uivir\, j_^ivijd, xiijA-/jij 
IPP2, LMP2, TAP1, LMP7, TAP2, 
DOB, DQB2 and RING8, 9, 13 and 
14 genes. 


952 

Zs ~J 


1 00 


1008 


gi38482 


Homo sapiens 


H.sapiens gene for major 
histocompatibility complex encoded 
proteasome subunit LMP7. 


952 


100 


1008 


gi596140 


Homo sapiens 


Human proteasome subunit LMP7 
(allele LMP7B) mRNA, complete 


952 


100 



WO 03/029271 



PCT/US02/30474 



133 
Table 2A 



SEQ 
ID 


Hit ID 


Species 


Description 


o 

£-m f\ 

score 


Jrerceni 
laeniiiy 








cds. 






1009 


gi9309471 


T T * 

Homo sapiens 


DNMT1 associated protein- 1 
(DMAP1) mRNA, complete cds. 


Zdoj 




1009 


gil0432820 


Homo sapiens 


cDNA FLJ1 1543 fis, clone 
HEMBA1002816. 


23oD 




1009 


gil2804007 


Homo sapiens 


clone MGC:3485 IMAGE:3635724, 
mRNA, complete cds. 


2385 


96 


1010 


gi6942267 


Danio rerio 


bHLH transcription factor Mesp-b 


80 


z / 


1010 


AAB95031 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16776. 


71 


26 


1010 


AAY87132 


Homo sapiens 


Human secreted protein sequence 
SEQ ID NO: 171. 


70 


51 


1011 


gi5815353 


Homo sapiens 


J domain containing protein 1 
isoform a (JDP1) mRNA, complete 
cds. 


782 


98 


1011 


gi6453811 


Mus musculus 


J domain protern 1 


548 


72 


1011 


gil4192847 


Rattus 
norvegicus 


J domain protein 1 


527 


69 


1012 


gi5777952 


Homo sapiens 


nuclear prelamrn A recognition 
factor mRNA, complete cds. 


1946 


99 


1012 


gi7021904 


FE 


cDNA FLJ 10067 fis, clone 
HEMBA1001526, weakly similar to 
PERIPLASMIC HYDRO GEN Abb 
1 (bC l.lo.yy.l). [Homo 


1946 


99 


1012 


AAB97260 


T T * 

Homo sapiens 


Human JNADF nydrogenase subumt 
50. 




QQ 

yy 


1013 


gi5917666 


Zea mays 


extensin-like protein 


All 


in 
z / 


1013 


gil5145797 


Sus scrofa 


basic proline-rich protein 


415 


29 


1013 


gi6523547 


Volvox carten f. 
nagariensis 


hydroxyproline-rich glycoprotein 
DZ-HRGP 


1 1 £. 


dL 


1014 


gi407468 


Mus musculus 


SEB4 


1 1 /z 


y3 


1014 


gi407419 


Homo sapiens 


H. sapiens seb4D mRNA. 


1141 


96 


1014 


gi40742 1 


Y" T • 

Homo sapiens 


TT • "1 A ~T~\ TI*V T A 

H. sapiens seb4B mRNA. 


1074 


yo 


1015 


gil2805043 


Homo sapiens 


clone IMAGE:3461487, mRNA, 
partial cds. 


2281 


100 


1015 


AAB75532 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 27 SEQ ID NO:86. 


680 


92 


1015 


gi9802536 


Arabidopsis 
thaliana 


F17L21.25 


247 


27 


1016 


gi7020516 


Homo sapiens 


cDNA FLJ20424 fis, clone 
KAT02627. 


1054 


99 


1016 


gil0438720 


Homo sapiens 


cDNA: FLJ22363 fis, clone 
HRC06574. 


1054 


99 


1016 


AAB43450 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO:895. 


779 


98 


1017 


gi 1043 8720 


Homo sapiens 


cDNA: FLJ22363 fis, clone 

tjd rn^sT/i 


1215 


100 


1017 


gi7020516 


Homo sapiens 


cDNA FLJ20424 fis, clone 
KAT02627. 


1212 


99 


1017 


AAB43450 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 895. 


644 


97 


1018 


gil2653673 


Homo sapiens 


tubulin, gamma 1, clone MGC:1593 
IMAGE:3345973, mRNA, complete 
cds. 


2320 


97 


1018 


gil 83703 


Homo sapiens 


Human gamma-tubulin mRNA, 


2311 


96 
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identity 








complete cds. 






1018 


gi4 115724 


Rattus 
norvegicus 


tubulin 




yo 


1019 


gil2653689 


Homo sapiens 


Similar to aspartyl-tRNA 
synthetase, clone MGC:1562 
IMA Orb: 3 344322, mKJNA, complete 
cds. 


2586 


100 


1019 


gil79l02 


TT ~ „ • „ _ „ 

Homo sapiens 


Human aspartyl-tKJNA syntnetase 
alpha-2 subunit mRNA, complete 
cds. 




OS 

yo 


1019 


gil4250408 


Mus musculus 


oimiiar to aspartyi-tKiN /\ syntnetase 


ZjZI 




1020 


AAB63116 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 39 SEQ ID 
NO: 126. 


2035 


99 


1020 


gil4039848 


Homo sapiens 


testes development-related NYD- 
Sri 9 mRNA, complete cds. 


1919 


99 


1020 


gil0437135 


Homo sapiens 


cDNA: FLJ21108 fis, clone 
CAISOdzo/. 


1144 


99 


1021 


gi60 15493 


Candida albicans 


proline transport helper PTH1 


345 


39 


1022 


gil0434528 


Homo sapiens 


T\\T A I - 'T T1 TO 1 zT XT! „ „ 1 „ 

cDNA FLJ12816 lis, clone 
NT2RP2002609, weakly similar to 
2-HYDROXYMUGQNIC 
SEMIALDEHYDE HYDROLASE 
(EC 3,1.1.-). 


1 oin 

loiy 


yy 


1022 


AAB94324 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 14807. 


loiy 


yy 


1022 


gi4929585 


Homo sapiens 


CGI-58 protein mKJNA, complete 
cds. 


1 r\oi 
iUzi 


D4 


1023 


gil2803055 


Homo sapiens 


Clone M(jrC:4ol4 IJVLA.(aIi:3j)U4zoo, 
mRNA, complete cds. 


oo4 


1 nn 

1UU 


1023 


gi8978394 


Chlamydophila 
pneumoniae 

T1 1Q 


T "1 , ^ 

rmp j 




ZD 


1023 


A A Y3 9473 


Homo sapiens 


DNAX interferon-like receptor 
subunit 2 protein sequence. 


73 


33 


1024 


gil0433H5 


TT„ ■ „ 

Homo sapiens 


„r\\T A T7T T1 1 TOO rtlrt-o 

cDJNA rLJ 1 1 /oj lis, clone 
HEMBA 1006005. 


71 Q 
/ ly 




1024 


AAB93824 


Homo sapiens 


Human protein sequence oxi^ lu 
NO: 13634. 


i iy 


1 nn 


1024 


gil70454 


t ~ : ., 

Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


/U 




1025 


gil4042550 


Homo sapiens 


cDNA FLJ14779 fis, clone 
NI2RP40UQ398, moderately similar 
to ZINC FINGER PROTEIN 140. 


1372 


58 


1025 


AAB93164 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12091. 


1372 


58 




glioDOUooo 


jtiomo sapiens 


iiZjf 1 1 -reiaieQ pro icm 1 niru. > -r*., 
complete cds. 






1026 


gil 1386005 


Homo sapiens 


hepatocellular carcinoma-associated 
protein HCA10 mRNA, complete 
cds. 


2802 


100 


1026 


gil3021980 


Homo sapiens 


hepatocellular carcinoma-associated 
protein HCA1 1 mRNA, complete 
cds. 


2802 


100 


1026 


gil3623377 


Homo sapiens 


clone MGC: 10536 


2802 


100 
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IMAGE:3958645, mRNA, complete 
cds. 






1027 


gi6063527 


Homo sapiens 


COPS7a mRNA for cop9 complex 
suDunit /a, complete cas. 


1241 


96 


1027 


gi6606550 


Homo sapiens 


COP9 complex subunit 7a mRNA, 
complete cds. 


1241 


96 


1027 


gil0433901 


Homo sapiens 


cDNA FLJ12426 fis, clone 
MAMMA1UU3 11-3, nignly similar to 
Mus musculus COP9 complex 
suDunit /a ^ LAj.ro/aj iiikina. 


1241 


96 


1028 


AAY35909 


Homo sapiens 


Extended human secreted protein 
sequence, oriA^ ijj jnu. i jo. 


854 


77 


1028 


gi6063527 


Homo sapiens 


COPS7a mRNA for cop9 complex 
suDunit /a, complete cas. 


622 


80 


1028 


gi6606550 


Homo sapiens 


COP9 complex subunit 7a mRNA, 
complete cds. 


622 


80 


1029 


"1 f AO A 1 A A 

gil5030l44 


Mus musculus 


Similar to RlKbJN cJJJNA 
1700127B04 gene 


0 1 0 


oo 


1029 


gi45l26l3 


Arabidopsis 
thaliana 


F28K20.12 


132 


39 


1029 


gil 5073552 


Smorhizobium 
meliloti 


HYPO 1 Hb 1 iCAb FRO 1 KLJN 


/z 


AO. 


1030 


gil4290488 


Homo sapiens 


clone MGC: 171 10 
lMACjb:418oioj, mRJNA, complete 
cds. 


1787 


100 


1030 


gil381027 


Homo sapiens 


Human mRNA for 
phosphoribosypyrophosphate 
synthetase-associated protein 39, 
complete cds. 


1775 


99 


1030 


gi436779 


Rattus 
norvegicus 


pnospnoribosylpyropnospnate 
synthetase-associated protein (39 
icua ) 


1 /Oo 


OR 


1031 


gil 5 126730 


Homo sapiens 


clone MGC: 16794 
IJVlALn}: 0000004, rnKiNA, complete 
cds. 


1672 


100 


1031 


gil 043 8924 


Homo sapiens 


cjjjna: ri_.JZzjuy ns, cione 
HRC 11803. 


1 fif^k 

IUOj 




1031 


A A T"» 1 C ^ AO 

AAB35408 


Homo sapiens 


Human u /Luz / gene protein. 


H 1 Z 




1032 


AAB95299 


Homo sapiens 


Human protein sequence SEQ ID 
JNO: 1 


2238 


99 


1032 


AAB67054 


Homo sapiens 


Human immune response molecule 
(1M.UJN j protein iu inu. o. 


1417 


98 


1032 


gi2232116 


Mus musculus 


mutated immunoglobulin heavy 
chain 


65 


31 


1033 


AAY70242 


Homo sapiens 


Human RNA-associated protein-23 
(RNAAr-23). 


1595 


97 


1 


gl 3 Uj /U1j 




J? T> ]\/f T>?\J A ViinHincr Tirofpin f-rrv-rbn 

XVXVJ.VJL J-Vi. > x\. L'lllLilll ti LJXULCJJ.1 VJIV X Jr 

(GRY-RBP) mRNA, complete cds. 


1595 

A *S *J 

* 


97 


1033 


gi3694986 


Mus musculus 


RRM RNA binding protein GRY- 
RBP 


1564 


95 


1034 


gil3436062 


Homo sapiens 


zinc finger protein 144 (Mel- 18), 
clone MGC: 10336 
IMAGE:3841545, mRNA, complete 
cds. 


1805 


97 


1034 


gi285933 


Homo sapiens 


Human mRNA for Mel- 18 protein, 


1805 


97 
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complete cds. 






1034 


gi6012977 


Felis catus 


Bmi-1 protein 


1057 


61 


1035 


gi3335159 


Homo sapiens 


bestroplun (VMD2) mRNA, 
alternatively spliced product, 
complete cds. 


o4j 




1035 


gi35 11242 


Homo sapiens 


vitelliform macular dystrophy 
protein (VMD2) mRNA, complete 
cds. 


04 j 




1035 


gi3598876 


Homo sapiens 


vitelliform macular dystrophy 
protein (VMJJZj gene, exon i i ana 
complete cds. 


645 


59 


1036 


gil3543935 


T 1" ' 

Homo sapiens 


clone MLrC:12yoo 
IMAGE:3 139887, mRNA, complete 
cds. 


997 
ZZ / 


Oi? 


1036 


gil4250415 


Homo sapiens 


clone IMAGE:3868989, mRNA, 
partial cds. 


213 


66 


1036 


gil006682 


Neisseria 
meningitidis 


outer membrane protein P64k or 
PM-6 


54 


33 


1037 


AAY48501 

* 


Homo sapiens 


Human breast tumour-associated 
protein 46. 


987 


98 


1037 


gi6635999 


Homo sapiens 


cardiovascular heat shock protein 
mRNA, complete cds. 


869 


100 


1037 


gil3623439 


Homo sapiens 


heat shock 27kD protein family, 
member 7 (cardiovascular), clone 
MGC: 12642 IMAGE:4131506, 
mRNA, complete cds. 


869 


1UU 


1038 


AAG01992 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6073. 


296 


100 


1038 


gi5911915 


Homo sapiens 


mRNA; cDNA DKFZp586M0622 
(trom clone JJi^ZvpjooJVlUoZz ); 
partial cds. 


81 


32 


1038 


gil 181346 


Paramecium 
bursaria 

Chlorella virus 1 


alo3.L 


00 




1039 


gil2655013 


Homo sapiens 


clone MGC:3036 IMAGE:3 163787, 
mKJNA, complete cas. 


1353 


100 


1039 


gil 04405 10 


Homo sapiens 


mRNA for FLJ00101 protein, partial 
cds. 


141 


35 


1039 


gi8132056 


Trypanosoma 
brucei 


phosphatase 


137 


30 


1040 


gil2655013 


Homo sapiens 


clone MGC: 3 03 6 IMAGE:3 163757, 
mRNA, complete cds. 




1UU 


1040 


gil 04405 10 


Homo sapiens 


mRNA for FLJ00101 protein, partial 
cds. 


141 


35 


1040 


gi8 132056 


Trypanosoma 
brucei 


phosphatase 


137 


30 


1041 


gil265zo49 


Homo sapiens 


riDOSomai piotem .LrZo, cione 
MGC:3230 IMAGE: 3 5 040 15, 
mRNA, complete cds. 






1041 


gil4603452 


Homo sapiens 


clone MGC:20081 

IMAGE:4054251, mRNA, complete 
cds. 


557 


98 


1041 


gil 5079503 


Homo sapiens 


clone MGQ20378 

IMAGE:4561118, mRNA, complete 
cds. 


557 


98 
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irl ail t\7 

laeniiiy 


1042 


AAW95630 


Homo sapiens 


Homo sapiens secreted protein gene 
clone gnl 14 1. 


358 


100 


1042 


AAG02785 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6866. 


245 




1042 


gi467292 


Dictyostelium 

mm « • -J 

discoideum 


glutamine-asparagine rich protein 


231 


24 


1043 


gi3 06809 


Homo sapiens 


Human glutathione S-transferase 2 
(GST) mRNA, complete cds. 


859 


99 


1043 


gi306810 


Homo sapiens 


Human glutathione S-transferase 
(GST) Ha subunit 1 mRNA, 
complete cds. 


859 


99 


1043 


gi306815 


Homo sapiens 


Human glutathione S-transferase 
(GST) a-subunit mRNA, complete 
cds. 


o c c\ 

859 


no 

99 


1044 


gi825605 


Homo sapiens 


H. sapiens GSTalpha gene for 
glutathione S-tranferase exon 2. 


534 


96 


1044 


gil 2804085 


Homo sapiens 


Similar to glutathione S-transferase 
A2, clone MGC: 10525 
IMAGE:3943417, mRNA, complete 
cds. 


534 


96 


1044 


gi306811 


Homo sapiens 


Human glutathione S-transferase Ha 
subunit 2 (GST) mRNA, complete 
cds. 


534 


96 


1045 


gi306809 


Homo sapiens 


Human glutathione S-transferase 2 
(GST) mRNA, complete cds. 


702 


100 


1045 


gi306810 


Homo sapiens 


Human glutathione S-transferase 
(GST) Ha subumt 1 mRNA, 
complete cds. 


702 


100 


1045 


gi306815 


Homo sapiens 


Human glutathione S-transferase 
(GST) a-subunit mRNA, complete 
cds. 


702 


100 


1046 


gil0438279 


T "T" * 

Homo sapiens 


cDNA: FLJ22029 fis, clone 
HEP08661. 


1071 


100 


1046 


gi 13097624 


Homo sapiens 


clone IMAGE:3 608084, mRNA, 
partial cds. 


615 


60 


1046 


gil3325154 


IT TT * 

Homo sapiens 


1 T"]\ Jf A / — IT" > O C~l t\C\ T"> 7v T A 

clone IMAGE: 3 635709, mRNA, 
partial cds. 


ouy 


A 1 

4 / 


-4 A mm mm 

1047 


gil755138 


Homo sapiens 


sorbitol dehydrogenase (SORD) 
gene, exons 8 and 9, and complete 
cds. 


1000 


1 An 
1UU 


1047 


gi520450 


Homo sapiens 


sorbitol dehydrogenase gene, 
complete cds. 


1228 


100 


1047 


gi496078 


Homo sapiens 


Human L-iditol-2 dehydrogenase 
mRNA, complete cds. 


1221 


99 


1048 


AAB58331 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ED 669. 


478 


97 


1 (\A Q 




xiomo sapiens 


jTLUiiiaii bcorcieu. proreiii, oiiy wu 
NO: 5850. 


61 




1049 


gil3276623 


Homo sapiens 


mRNA; cDNA DKFZp761B1514 
(from clone DKFZp761B1514). 


1633 


100 


1049 


gil4042525 

• 


Homo sapiens 


cDNA FLJ14766 fis, clone 
NT2RP3004041. 


1626 


99 


1049 


AAB95783 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18735. 


1626 


99 


1050 


gil5011995 


Homo sapiens 


clone MGC: 13350 


943 


95 
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IMAGE:4333615, rnRNA, complete 
cds. 






1050 


AAB56131 


Homo sapiens 


Human secreted protein sequence 
encoded by gene jj MiC<J iu 
NO:225. 


943 


95 


1050 


gil2052900 


Homo sapiens 


mRNA; cDNA DrLrZpDo4(jrUo /o 
(from clone DKFZp564G0678); 
complete cds. 






1051 


gil0435735 


Homo sapiens 


cDNA FLJ13657 fis, clone 

"T»T A /"t'l"? 1 A 1 1 C /TO 

PLACE101 1563. 


1819 


99 


1051 


AAB94709 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15705. 


1 Q1Q 


QQ 


1051 


gil2043689 


Plasmodium 
falciparum 


merozoite surface protein 


1 1 c 

1 1 J 




1052 


gil 2654509 


Homo sapiens 


Similar to SH3 and FX domain- 
containing protein SH3PX1, clone 
IMAGE: 295 84 15, mKJNA, partial 
cds. 






1052 


gi54 10249 


Homo sapiens 


SDP1 protein mRNA, complete cds. 


259 


42 


1052 


gi4689258 


Homo sapiens 


sorting nexin 9 (SNX9) mRNA, 
complete cds. 






1053 


gil 5074372 


Sinorhizobium 
meliloti 


HYPOTHETICAL 
TRANSMEMBRANE PROTEIN 


102 


39 


1053 


gi6002569 


Coturmx 
japonica 


POU-box protem brain-2 




1 Q 
DO 


1053 


gil3623799 


Homo sapiens 


FZD8 mRNA for seven- 
transmembrane receptor Frizzled- 8, 
complete cds. 


9o 


4U 


1054 


gi28384 


Homo sapiens 


H.sapiens ADE2H1 mRNA showing 
homologies to SAICAR synthetase 
and AIR carboxylase of the purme 
pathway (EC 6.3.2.6, EC 4.1.1.21). 


1372 


99 


1054 


gi976252 


Rattus 
norvegicus 


A TT* 1 1 CI A T /"I A T* 

AIR carboxylase-SAICAR 
synthetase 


1334 


yj 


1054 


gi211194 

• 


11 1 "f 

Gallus gallus 


r* ..... | '-t 1 j. * J 

5-ammoimidazole ribonucleotide 
(AIR) carboxylase -5- 
ammoimidazole-4-N- 
succinocarboxamide ribonucleotide 
(SAICAR) synthetase 


1248 


o o 


1055 


AAG01499 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5580. 


293 


100 


1055 


gi9188555 


Chlorella 
vulgaris 


Homolog ot cmcK neat snook: iactor 
3 


AH 
OU 




1055 


gi3098309 


Crithidia 
fasciculata 


HI histone-like protein p21 


74 


26 


1056 


gi3 115347 


Homo sapiens 


BAC clone CTA-27603 from 7q22- 

1 rorrrnlete cjeniieiice 


121 


29 


1056 


gil 770454 


Homo sapiens 


H.sapiens mRNA for M-phase 
phosphoprotein, mpp 1 1 . 


121 


29 


1056 


gi3 13202 


Drosophila hydei 


mstl01(2) 


123 


24 


1057 


gil4550467 


Homo sapiens 


spermatogenesis associated 2, clone 
MGC:15312 IMAGE: 3 95 3 522, 
mRNA, complete cds. 


184 


26 


1057 


AAB95348 


Homo sapiens 


Human protein sequence SEQ ID 
NO:17635. 


184 


26 
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1057 


gi4730927 


Homo sapiens 


spermatogenesis associated PD1 
mRNA, complete cds. 


181 


26 




gil3544063 


Homo sapiens 


clone MGC:13219 
IMAGE:3959259, mRNA, complete 
cds. 


254 


54 




AAB28631 


Homo sapiens 


Human breast tumour-specific 
antigen B21GT2. 


220 


49 




gi8272464 


Homo sapiens 


human endogenous retrovirus W 
gagC3.37 G gag (gag) gene, 
complete cds. 


202 


4o 


1058 


gil4334374 


Homo sapiens 


leucine zipper protein AF5 alpha 
mRNA, complete cds. 


649 


74 


1058 


gil3529293 


Homo sapiens 


Similar to leucme zipper protem 
FKSG14, clone MGC: 12540 
IMAGE:3839409, mRNA, complete 
cds. 


649 


>-l A 

74 


1058 


gil4250169 


Homo sapiens 


Similar to leucme zipper protem 
FKSG14, clone MGC: 14847 
I3VLAGE:35 11065, mRNA, complete 
cds. 


649 


74 


1059 


AAB62180 


Homo sapiens 


Human p95 protem. 


2395 


on 

&y 


1059 


gil5080468 

• 


Homo sapiens 


Similar to RIKEN cDNA 
1110018J12 gene, clone 
IMAGE:3865164, mRNA, partial 
cds. 


660 


100 

- 


1059 


gil65520 


Oryctolagus 
cumculus 


smooth muscle myosin heavy chain 


144 


24 


1060 


gil3436353 


Homo sapiens 


clone IMAGE:3632533, mRNA, 
partial cds. 


166 


35 


1060 


gil3186114 


Homo sapiens 


mRNA for rab interacting lysosomal 
protein (RILP gene). 


166 


35 


1060 


AAG02093 


Homo sapiens 


Human secreted protem, SEQ ID 
NO: 6174. 


14U 


4z 


1061 


gil0440398 


Homo sapiens 


mRNA for FLJ00032 protem, partial 
cds. 


Oil o 

222o 


OU 


1061 


gil3752754 


Homo sapiens 


zinc ringer 1 1 1 1 mRNA, complete 
cds. 


zloo 


~>y 


1061 


gil 86774 


Homo sapiens 


Human Kruppel related zinc finger 
protem (HTF10) mRNA, complete 
cds. 


2130 


52 


1062 


AAY 17227 


Homo sapiens 


Human secreted protein (clone yal- 

i). 


1 A H £L 
14 /O 


yy 


1062 


gil2831176 


Agelaius 
phoeniceus 


gamma filamm protem 


183 


26 


1062 


AAY94938 


Homo sapiens 


Human secreted protem clone 
ye78_l protein sequence SEQ ID 
JNU:oZ. 


169 


25 


1063 


gil 4042 641 


Homo sapiens 


cDNA FLJ14834 fis, clone 
OVARC 100 1270. 


560 


100 


1063 


AAB94558 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15328. 


560 


100 


1063 


gi246315 


Mus sp. 


immunoglobulin lambda light chain 
variable region 


58 


39 


1064 


gil3752754 


Homo sapiens 


zinc finger 1111 mRNA, complete 
cds. 


2304 


58 
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1064 


gil0440398 


Homo sapiens 


mRNA for FLJ00032 protein, partial 
cds. 


2294 


59 


1064 


gil 0436789 


Homo sapiens 


cJJJNA ri-rJ 1434D lis, cione 
THYRO 1001 189, weakly similar to 
ZINC FINGER PROTEIN 91. 




SO 


1065 


gi2226004 


XT - • 

Homo sapiens 


Human liggerl txansposaDie 
element, complete consensus 
sequence. 


JJO 


^0 


1065 


gil434936S 


Homo sapiens 


Clone lVL*orLx. loooD 

IMAGE:3937996, mRNA, complete 
cus. 






1065 


AAB39379 


Homo sapiens 


Human secreted protein sequence 
cntoucu oy gene z, i oxjv^/ 
NO:128. 


200 


38 


1 f\/Z£ 

lUoo 


glzo4DUoo 


riyuia vulgaris 


coiidgcn 


86 


39 


1066 


gi555157 


Human 

l^am^ AHX 711*1 lfl /I 

nerpesvirus 4 


nuclear antigen 1 


68 


39 


1066 


AAW99376 


Homo sapiens 


Human fibroblast growth factor 2 24 
kD isoformN-terminus, 


67 


33 


1067 


gl7453o41 


urosopmia 
melanogaster 


casein Kinase n Dcta suduiul 




^7 


1068 


gi7022973 


Homo sapiens 


cDNA FLJ10749 fis, clone 
NT2RP3001915. 


210 


32 


1068 


AAB93129 


Homo sapiens 


xluman protem sequence oJd^ i-L' 
NO:12017. 




19 

JZ 


1068 


gil2053201 


TT _ „^ „ • ^ 

Homo sapiens 


rnKJNA; CUJNA JJJvrZ^p4o4AlUJl 

^rrom cione i^jsjrz^p'+j^f/v.iu^ i ^, 
complete cds. 


91 n 


19 

JZ 


1069 


gil4042847 


Homo sapiens 


cjjjna r jlj 14VD / us, cione 
PLACE4000009, weakly similar to 
MYOSIN HEAVY CHAIN, 
NONMUSCLE TYPE B. 




OQ 


1069 




Homo sapiens 


xluman protein sequence oxiv^ ijj 
NO: 18167. 




QQ 


1069 


gloiUo 1 / 0 


riomo sapiens 


cingunn miviN/\, complete cab. 


1 09 1 

1 UZ 1 


44 


1070 


AAB47134 


Homo sapiens 


CDIFF-15, Incyte ID No. 
jh 1 oj / 1 y^u 1 . 


3992 


99 


1070 


gi7022185 


Homo sapiens 


cDNA FLJ10260 fis, clone 
ririivirSo 1 uuuy / j , moaerateiy 
similar to Mus musculus schlafen3 

Iliivl > s\ . 


1182 


46 


1070 


AAB92636 


Homo sapiens 


Human protein sequence SEQ ID 
*Nrn« 1 oqs 1 


1182 


46 


1 C\H 1 
1U / 1 


~: i 70/: 1 0 
gll /ZO 1Z 


oaccnaromyces 
cerevisiae 




172 


36 


1 m 1 
1U/1 


gll /zoy / 


oaccnaromyces 
cerevisiae 

VVJl V ▼ -*■ ^ w 


olxxvl 


1 79 


36 


1071 


gil230657 


Saccharomyces 
cerevisiae 


Ssdlp 


172 


36 


1072 


gil 043 8029 


Homo sapiens 


cDNA: FLJ21841 fis, clone 
HEP01831. 


2194 


99 


1072 


gi35019 


Homo sapiens 


H.sapiens nestin gene. 


1642 


78 


1072 


AAR60127 


Homo sapiens 


Human nestin protein is useful to 
identify brain tumours. 


1642 


78 


1073 


AAY91424 


Homo sapiens 


Human secreted protein sequence 


997 


96 
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encoded by gene 12 SEQ ID 
NO:145. 






1073 


AAY41495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 70. 


1 n<r 
IUj 




1073 


gi3879551 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PrOliyl (Collagen triple neiix 
repeat (20 copies)), Score=56.4, E- 

im i, 1Q n XT — 9 ' "DTTHI ASIA 

value— ze-io, in— z, rrui'+o^ 
(Nematode cuticle collagen N- 
xcniiiiiai Qornaiiij 3 ocore — o/.z ? 
value=l.le-22,N=l 


1 AQ 

lUo 




1074 


gi2Q5278 


Kattus 
norvegicus 


male germ ceii-associaieu Kinase 
(mak) 


ZIOO 


RO 


1074 


gi53914 


Mus musculus 


rck 




70 


1074 


gil2002678 


Homo sapiens 


JVLAJv-reiatea Kinase (i^ivz ) 
rntviNA, complete cas. 


1 1 S? 


t / 


1075 


AAG03354 


Homo sapiens 


Human secreted protein, SEQ ID 

lNL/. 


373 


100 


1075 


gi25H438 


Homo sapiens 


calcium/calmodulin-dependent 
piotem Kinase 11 mi\j.N/\, pdniai out>. 


71 


30 


1075 


AAB95290 


Homo sapiens 


Human protein sequence SEQ ID 

IN \J . 1 / DLL. 


66 


42 


1076 


gi / j /S jyj 


ivius museums 


teasnin z 






1076 


gi7527470 


Mus musculus 


zinc finger protein 


1603 


45 


1076 


gi3 170196 


Homo sapiens 


antigen in x-lu-jj (in i-^v^-j>j^ 
mrsJ.N/\, complete cub. 






1 AT7 
10 / / 


gli4U.5UZo 


L^aiotes 

VCJLolOUlUI 




1 50 


32 


1077 


gil621501 


Homo sapiens 


Human REST protein mRNA, 


145 


32 


1077 


AAR99365 


Homo sapiens 


Human REST protein. 


145 


32 


1 m o 


aa y _>yooz 


riomo sapiens 


OCClCietl piULClll 1UO"uU7 J V 

FL. 




99 


1078 


AAY01635 


Homo sapiens 


Human PS214 derived polypeptide. 


951 


99 


1078 


AAY64650 


Homo sapiens 


rluman luman nomoiogy pro tern. 


yD l 


00 

yy 


1079 


gil2002058 


Homo sapiens 


p5326 mRNA, complete cds. 


1339 


100 


1079 


AAB36595 


Homo sapiens 


Human FLEXHT-17 protein 
sequence orLv^ ID JNvJ:l / . 


1339 


100 


1079 


gil3325192 


Homo sapiens 


clone IMAGE:3 640823, mRNA, 
partial cas. 


1302 


100 


1080 


gi 1404 1646 


Homo sapiens 


mRNA for OTT-MAL protein 
resulting from variant ( 1 ;22) 
translocation breakpoint. 


468 


42 


1080 


gil4161367 


Homo sapiens 


megakaryoblastic leukemia- 1 
protein (MKLl) mRNA, complete 
cds. 


468 


42 


1 HQfk 
lUoU 


gl LHKJH^ lOlo 


jnomo SapiCIlb 


T~nT? Tsl A fnr mpoararvnr^ir apnfp 
iiiJXiN^rv iui xxie^dL/<xi y*J\sy civ-/ lilv^ 

leukemia (MAL gene). 


468 


42 

I 


1081 


gi7020770 


Homo sapiens 


cDNA FLJ20571 fis, clone 
REC01040. 


746 


88 


1081 


gil2653983 


Homo sapiens 


HSPC142 protein, clone MGC:774 
IMAGE:3504488, mRNA, complete 
cds. 


746 


88 


1081 


gil3623287 


Homo sapiens 


HSPC142 protein, clone 
MGC: 11295 IMAGE: 3 947341, 


746 


88 
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mRNA, complete cds. 






1082 


AAB75321 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 34 SEQ ID 
NO: 140. 


507 


100 


1082 


gi3875721 


Caenorhabditis 
elegans 


F11C1.1 


155 


32 


1082 


gi8 17987 


Mus musculus 


neural cell adhesion molecule 
NCAM-140 andNCAM-180 


77 


41 


1083 


gil0433795 


Homo sapiens 


cDNA FLJ 12343 fis, clone 
MAMMA1 002292. 


2919 


100 


1083 


AAB93956 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 14000. 


2919 


100 


1083 


AAY86552 


Homo sapiens 


Human gene 79-encoded protein 
fragment, SEQ ID NO:469. 


807 


87 


1084 


gi6855513 


Gallus gallus 


syndesmos 


552 


63 


1084 


gi 13 623 247 


Homo sapiens 


Similar to RIKEN cDNA 
1110001K21 gene, clone 
MGC: 11275 IMAGE:3944355, 
mRNA, complete cds. 


533 


61 


1084 


gil 5 157405 


Agrobacterium 
tumefaciens 


AGR_C_4093p 


86 


36 


1085 


gil 0436269 


Homo sapiens 


cDNA FLJ 13 95 6 lis, clone 
Y79AA1001185. 


s~ p—v 

672 


100 


1085 


AAB94844 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16020. 


672 


100 


1085 


AAG01524 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5605. 


72 


36 


1086 


gil2659138 


Mus musculus 


mage-d3 


149 


31 


1086 


gil3123516 


Mus musculus 


magphinm-gamma 


141 


30 


1086 


gil3123514 


Mus musculus 


magphinin-beta 2 


141 


30 


1087 


gi2695659 


Bos taurus 


pyruvate dehydrogenase 
phosphatase regulatory subunit 
precursor; PDPr 


946 


80 


1087 


gil4022085 


Mesorhizobium 
loti 


sarcosine dehydrogenase 


347 


37 


1087 


gil 5075297 


Sinorhizobium 
meliloti 


PUTATIVE OXIDOREDUCTASE 
PROTEIN 


344 


39 


1088 


gil 0440476 


Homo sapiens 


mRNA for FLJ00075 protein, partial 
cds. 


843 


98 


1088 


AAY35936 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 185. 


395 


100 


1088 


AAB64417 


Homo sapiens 


Amino acid sequence of human 
intracellular signalling molecule 
INTRA49. 


360 


100 


1089 


gi538413 


Mus musculus 


zinc finger protein 


3113 


94 


1089 


gil 86774 


Homo sapiens 


Human Kruppel related zinc finger 
protein (HTF10) mRNA, complete 
cds. 


2466 


57 


1089 


gi6088100 


Homo sapiens 


mRNA for zinc finger protein 
(ZFD25), complete cds. 


2096 


52 


1090 


gil4719305 


Mus musculus 


SNX20 


1242 


77 


1090 


gil4719307 


Homo sapiens 


SNX21 (SNX21) mRNA, complete 
cds. 


404 


37 


1090 


gil 4 149068 


Homo sapiens 


Novel human gene mapping to 
chomosome 20. 


404 


37 
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1U V* it ti iy 


1091 


gil4597912 


Homo sapiens 


human CLASP-3 


6116 


99 


1091 


gll459oU3 / 


rlomo sapiens 


numan v^jlaoi - / 




71 


1091 


gil0440381 


Homo sapiens 


mRNA for FLJ00026 protein, partial 
cds. 


3616 


60 


1092 


gi2425 111 


Dictyostelium 
discoideum 


ZlpA 


1 AS 
lOo 


9 1 
Z 1 


1092 


gil67835 


Dictyostelium 
discoideum 


myosin heavy chain 


105 


20 


1092 


gi3044185 


Plasmodium 
falciparum 


mature parasite-infected erythrocyte 
suriace antigen 


127 


22 


1093 


AAB63245 


Homo sapiens 


Human breast cancer associated 
antigen protein sequence b.bQ ID 
NO:607. 


1775 


100 


1093 


gil5079591 


Homo sapiens 


Similar to cyclin D3, clone 
MGC: 10400 IMAGE:3946312, 
mKJN A, complete cas. 


1 <T 1 A 

ID 14 


1 AA 
1UU 


1093 


gil81247 


Homo sapiens 


cyclin D3 (CCND3) mRNA, 
complete cds. 


1514 


100 


1094 


gi8926320 


Rattus 
norvegicus 


corneal wound neaimg related 
protein 


DOVy 


7J 


1 AH A 

1094 


gi 1043 7745 


Homo sapiens 


cJJJNA: ri^J/1013 us, clone 
COL07381. 


ZODD 


OR 


1094 


gil0439113 


Homo sapiens 


_-r\\T A . T7T TT)/C/tl -fie «1ayio 

cJJJNA: ri^JZ/o4j lis, clone 
HSI07031. 


I 1 1 A 

I I 10 


1 CiC\ 
1UU 


1095 


gi7023247 


Homo sapiens 


nTV\T A "DT T 1 AO AO -Fin nlAna 

cDJNA r Lj luyUo lis, clone 
OVARC 1000087, weakly similar to 

rilo 1 VJiNli IVlAv^lvvJ-x iZA. 1 . 


Oil fi 


QQ 


1095 


AAB93319 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12406. 


2116 


99 


1095 


gil0433992 


Homo sapiens 


cJJJNA ri^J 1z4oU lis, clone 
NT2RM1001066. 


ZZZ 1 


1UU 


1096 


AAB43434 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO:879. 






1096 


AAB57205 


Homo sapiens 


Human prostate* cancer antigen 
protein sequence b.bQ 1JJ JNU. 1 /qd. 


459 


81 


1096 


gi 13097207 


Homo sapiens 


ribosomal protein, large, PI, clone 
MCjC:5215 lMACjJb:29Uuo4o, 
mRNA, complete cds. 


452 


84 


1097 


gill 138036 


Homo sapiens 


mRNA, similar to rat myomegalin, 
complete cds. 


885 

i 


100 


1097 


gil3543594 


Homo sapiens 


similar to rat myomegalin, clone 
MGC: 14586 IMAGE: 424743 3, 
mKJN A, complete cas. 




AO 


1097 


gill 138042 


Homo sapiens 


mRNA, similar to rat myomegalin, 
complete cds. 


833 


97 


1098 


gi7332088 


Caenorhabditis 
eiegans 


contains similarity to Pfam family 
rr\j i~>y l ^i^oiiagen mpic hciia 
repeat (20 copies)), score=73.8, 
E=3.5e-18, N-2 


98 


36 


1098 


gi4218141 


Antirrhinum 
majus 


HMR1 protein 


96 


34 


1098 


gil56262 


Caenorhabditis 
eiegans 


collagen 


93 


35 


1099 


gi 12224960 


Homo sapiens 


mRNA; cDNA DKFZp547J036 
(from clone DKFZp547J036). 


1336 


100 
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1099 


gil5145797 


Sus scrofa 


basic proline-rich protein 


143 


32 


1099 


gi904359 


Beta vulgaris 


cnitmase 1 


1 ^7 


d y 


1100 


gi8163691 


Streptococcus 
pneumoniae 


suriace piotein rspd 


1 1 n 

1 1U 




1100 


gi433475 


Drosophila 
melanogaster 


protein 34-specific exons 


97 


24 


1100 


gil58694 


Drosophila 
melanogaster 


tropomyosin isoform 34 (9B) 


86 


34 


1101 


gi467067 


Mycobacterium 
leprae 


B2168_F1__37 


100 


27 


1101 


gil296947 


Tentyria schaumi 


cytochrome oxidase I 


68 


42 


1101 


gil296883 


Hegeter glaber 


cytochrome oxiaase i 




40 


1102 


gil0435272 


Homo sapiens 


cDNA FLJ13305 fis 5 clone 

U V AK.L.lUUl z>yy. 


2297 


99 


1102 


AAB94564 


Homo sapiens 


Human protein sequence oxii^ ru 
NO:15341. 


9907 

ZZJ7 / 


yy 


1102 


gi2947228 


Plasmodium 
yoelii yoelii 


erytnrocyte Dinamg protem 


1 fid 


1 Q 
i y 


1103 


gil3276571 


Homo sapiens 


partial mRNA for chrlO 
synaptotagmm (CrlKlUbi 1 genej. 


824 


98 


1103 


gil932801 


Rattus 
norvegicus 


synaptotagmin X 


381 


35 


1103 


gi6136792 


Mus museums 


synaptotagmm X 






1104 


AAY73389 


Homo sapiens 


HTRM clone 4173 1 1 1 protein 
sequence. 


1023 


100 


1104 


AAB32463 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 22 SEQ ID 
NO:149. 


in A 


1UU 


1104 


AAB32465 


Homo sapiens 


Human secreted protein sequence 
encoded by gene zz b±i<s2 m 
JNU: ID 1. 


235 


100 


1105 


gi 13 543 642 


Homo sapiens 


clone MGC: 14625 
lJVlAijrii:4U /Ooiu, mKJN/\, complete 
cds. 


561 


89 


1105 


•it) 1 or 

gi2343l85 


Homo sapiens 


tubulin iolaing coiactor r> mKJ.N/\, 
complete cos. 


jou 


00 


1105 


AAW47209 


Homo sapiens 


Homo sapiens tubulin-folding 
coiactor d. 


560 


90 


1106 


gi3790545 


Mus musculus 


neuronal protein 4.1 


3665 


95 


1106 


gi4587H8 


Rattus 
norvegicus 


rat bram 4.1^; 




01 


1106 


gi4587l20 


Rattus 
norvegicus 


rat brain4.1(L) 


2882 


79 


1107 


gi3790545 


Mus musculus 


neuronal protein 4.1 


4367 


95 


1107 


gi4587ll8 


Rattus 
norvegicus 


rat brain4.1(S) 


4361 


95 


1 1 n*7 
1 1U / 


sin 1 on 
gl4Do / 1ZU 


"D nfhlO 

K.attus 
norvegicus 


lal Drain 


2882 

■iw U v_J Z^ 


79 


1108 


gi5912239 


Homo sapiens 


mRNA; cDNA DKFZp4340225 
(from clone DKFZp4340225); 
partial cds. 


290 


34 


1108 


gi5912034 


Homo sapiens 


mRNA; cDNA DKFZp434N0535 
(from clone DKFZp434N0535); 
partial cds. 


290 


34 


1108 


gi7023228 


Homo sapiens 


cDNA FLJ10898 fis, clone 


230 


48 
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IN IzKJOUUj4yz. 






1109 


gil0179839 


: 

Homo sapiens 


l ny- 1 glycoprotein ana i ny- 1 oo- 
transcribed protein mRNAs, 
complete cas. 


761 


100 

X \J\J 


1109 


gi2944066 


Canis familiaris 


synapsin I 


110 


33 


1109 


gi530876 


Chlamydomonas 
reinhardtii 


ammo acid leature. Koa proteni 
domain, aa 266 468; amino acid 
leature; gioouiar proieui aomam, aa 
32 265 


QQ 


z*y 


mo 


gil51442 /I 


Homo sapiens 


IN I JJ-Oi 1 1 mjxiN/Y, oompicLc out>. 


90S? 


61 


1110 


gllOl /OZSl 


Arabidopsis 
thaliana 


Katanm pou suDuniL-iiKe proicm 


1 ^0 


25 


1 1 10 


gil4578567 


bcnizosaccnarom 
yces pombe 


omci Kinase omamg pioiem ij 




20 


1111 

mi 


gil5144z /3 


Homo sapiens 


proiein Kinase in i orz, i + iiu\_ln^\. 5 
complete cds. 


1 


100 


1111 
mi 


gil04377Uz 


Homo sapiens 


CUiN/Y. JT J-* J Z. ID /y lib, t/lUllc 

COL06761. 


141 S 


100 

X \J\J 


1111 

llll 


gll4647541 


z,apus nuasomus 


growtn normone leccpiui 




37 


1112 


AAB95856 


Homo sapiens 


Human protein sequence SEQ ID 
xrrv 1 sol A 


109 


28 


1112 


gi3885945 


Omithorhynchus 
anatmus 


amelogenin 


94 


28 


1112 


gi3885947 


Tachyglossus 
aculeatus 


amelogenin 


94 


29 


1113 


gil0433312 


Homo sapiens 


cDNA FLJ1 1939 fis, clone 

HJilvlxSrS lUUUDVZ. 


1110 


100 


lll3 


AAB93872 


Homo sapiens 


Human protein sequence SEQ ID 

INvJ. ID III. 


1110 


100 


lll3 


gi560496 


Cercopithecine 

i * 1 
nerpesvirus 1 


glycoprotein G (homologue of HSV- 

Z Uo^+J 


109 


34 


lll4 


gil0433986 


Homo sapiens 


cDNA FLJ 12476 fis, clone 
NT2RM1000978. 


3253 


100 


*l *1 4 j4 

1114 


AAB94049 


Homo sapiens 


rluman protein sequence ori/V,/ ijj 
NO:14213. 




1 00 


-1 -i -l A 

1114 


gil5080775 


TT 1 

Homo sapiens 


4__ ■__ 1.' nn . X"TVnP\ C 1 T) C t-v-iT>"MA 

protein Kinase JN YU-orj hikina, 
complete cas. 


1 jOj 


yo 


1115 


gi4159888 


Homo sapiens 


PAC clone RP5-855D21, complete 
sequence. 


2212 


99 


1115 


gil769491 


Homo sapiens 


Human kruppel-related zinc finger 
protem (Z/iNr io4j mKiN/\, partial 
cds. 


1127 


49 


1 1 1 c 

lll5 


gili/52754 


Homo sapiens 


zinc rrnger 1 1 1 1 rnJKJN/\, complete 
cds. 


1 077 

IV/// 


47 


111/ 
1 1 16 


gil 1464740 


Homo sapiens 


gigaxonm itikjn/\, complete cas. 


^1 60 

D 1 Uv 


1 00 

X \J\J 


1116 


gi3789797 


Homo sapiens 


actin binding protein MAYVEN 
ttiRNA complete cds. 


633 


28 


1116 


gi6644176 


Homo sapiens 


kelch-like protein KLHL3a 
(KLHL3a) mRNA, complete cds. 


593 


28 


1117 


gil3278963 


Homo sapiens 


clone IMAGE:3603836, mRNA, 
partial cds. 


1224 


99 


1117 


AAG02054 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6135. 


678 


95 


1117 


gi9757771 


Arabidopsis 
thaliana 


RING zinc finger protein-like 


318 


34 
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s 
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identity 


1118 


gi3869259 


Homo sapiens 


ZNF202 beta (ZNF202) mRNA, 
complete cuts. 


684 


40 


1118 




rlomo sapiens 


(from clone DKFZp727E21 1); 

pd.1 Llcll L/Lio. 


684 


40 1 


I 1 1 o 

I I lo 


A A VOQQ 1 ^7 

AAizyyi / 


xiomo sapiens 


Mnmnn r^T-TTjl rvrntpi'n encoded bv 

transcript cDNAl. 


684 


40 


I 1 1 o 

I I iy 


AAV4olzD_ 

aal 


xiomo sapiens 


TvJnr'lf^ntiHf* ^pmiRTipp pnf!odin0 r clone 
HMWGS46 of Prombitin receptor 

■fc*TT"ll1 1 l/ 

xaiiuxy . 


1297 


89 


1119 


gi2289906 


Mus musculus 


BAP 


1296 


89 


1119 


gi65 63274 


Homo sapiens 


B-cell receptor-associated protein 


1296 


89 


1120 


gil3540403 


Lens culinaris 


histone HI 


104 


28 


1120 


gil3425157 


Caulobacter 
crescentus 


arylesterase-related protein 


99 


28 


1120 


gi7331176 


Homo sapiens 


BM88 antigen mRNA, complete 

COS. 


89 


31 


1121 


AAB58852 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence SEQ ID 

JOl). 


511 


100 


1121 


AAY94955 


Homo sapiens 


Human secreted protein clone 
khl3_4 protein sequence SEQ ID 

XNvJ. 1 lO. 


66 


34 


1 i i 1 

1 121 


A A AH 7*7 "3 /111 

AAW /341 1 


Homo sapiens 


xiuman secreieu. pruiciii ciiuuu.cu. uy 
Gene No. 15. 


61 


29 


1122 


giijyjozoJi 


T T •*'V% **"V n «M y\-**% o 

xiomo sapiens 


IMAGE:3028040, mRNA, complete 


1370 


59 


1122 


gi5262557 


Homo sapiens 


mRNA; cDNA DKFZp569D223 1 

ffmm rlonp T)KF7r>^69D223 1 V 

partial cds. 


1370 


59 


1122 


gi5441615 


Canis familiaris 


zinc finger protein 


1365 


56 


1123 


AACjU2 /40 


rlomo sapiens 


xiuman secrete a. protein, onv.y lu 
NO: 6821. 


458 


98 


1123 


AAB65850 


Homo sapiens 


Human TANGO 281 extracellular 
oomain oxiv ji. 


70 


31 


1123 


gill85157 


Odontella 

« • 

sinensis 


SOS ribosomal protein L29 


64 


22 


1124 


gi2576348 


Homo sapiens 


Human Chromosome 16 BAC clone 
CIT987SK-A-735G6, complete 
sequence. 


1520 


100 


1124 


gil3775464 


Caenorhabditis 
elegans 


Contains similarity to r^iam aomain. 
rTUU /4y (^tiviN /\-syni_ 1CJ, 
Score=323.3, E-value=8.9e-94, N=l 




_J o 


1 124 


~;*l/TOO*»^C 

gi2688265 


Borrelia 
burgdorferi 


/v1-ii4-nn^»i1 4-T5~\.T A nmf na^OCO I frit >t 1 

glutamyl-tKJNA syntnetase ^giiA.j 


/ UJ 




H25 


gil2654783 


Homo sapiens 


Similar to loss of heterozygosity, 
11, chromosomal region 2, gene A, 
clone MGC:4904 IMAGE: 346 1486, 
mRNA, complete cds. 


2141 


100 


H25 


AAB82047 


Homo sapiens 


Human mast cell surface antigen. 


2141 


100 


U25 


gi2 190974 


Homo sapiens 


breast cancer suppressor candidate 1 
(bcsc-1) mRNA, complete cds. 


338 


100 


1126 


gi4884114 


; Homo sapiens 


mRNA; cDNA DKFZp586G0518 


4805 


99 
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S 
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(from clone DKFZp586G0518); 
partial cds. 






1126 


AAB95305 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17544. 


3327 


99 


1126 


AAB92704 


Homo sapiens 


Human protem sequence SEQ ID 
NO: 11 106. 


2993 


99 


1127 


AAY25717 


Homo sapiens 


Human secreted protein encoded 
from gene 7. 


66 




1127 


gi7706791 


Arabidopsis 
tnaliana 


3 -hydroxy-3 -methylglutaryl 
coenzyme A reductase isoform 
HMGR1L 






1127 


gil4861050 


Omithorhynchus 
anatinus 


suliotransierase bU.Li 1A 


1 o 




1128 


gi9501358 


Mus museums 


Paxl protein 


lOOO 


on 


1128 


gil2658965 


Mus musculus 


PAX1 


1683 


90 


1128 


gi200224 


Mus musculus 


Pax-1 protem 


lOCD 


OA 


1129 


AAW82571 


Homo sapiens 


Human BBP1 DNA. 


1829 


93 


1129 


AAW82572 


Homo sapiens 


Human BBP1 protem fragment. 


/in 

417 


91 


1129 


gi563377 


Homo sapiens 


H.sapiens (JER58) MUC5AC 
mRNA for mucin (partial). 


199 


25 


1130 


AAY 10834 


Homo sapiens 


Amino acid sequence of a human 
secreted protem. 


1603 


99 


1130 


gil0433027 


Homo sapiens 


cDNA FLJ1 1712 fis, clone 
HEMBA1005185. 


1597 


99 


1130 


AAB93801 


Homo sapiens 


Human protem sequence SEQ ID 
NO:13573. 


1597 


99 


1131 


AAY10834 


Homo sapiens 


Amino acid sequence of a human 
secreted protem. 


1565 


100 


1131 


gil0433027 


Homo sapiens 


cDNA FLJ1 1712 fis, clone 
HEMBA1005185. 


1559 


99 


1131 


AAB93801 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13573. 


i c c r\ 

1559 


99 


1132 


gil3195147 


Mus musculus 


HCH 


9033 


94 


1132 


gil339910 


Homo sapiens 


Human DOCK180 protem mRNA, 
complete cds. 


c t: /in 

5566 


ol 


1132 


AAW03515 


Homo sapiens 


Human DOCK180 protem. 


5563 


/C 1 

61 


1133 


gil2053209 


Homo sapiens 


mRNA; cDNA DKFZp434E2135 
(from clone DKFZp434E2135); 
complete cds. 


549 


100 


1133 


gi695686 


Picea abies 


dall 


59 


42 


1133 


gi4894319 


Bacillus 
anthracis 


pXOl-104 


53 


30 


1134 


gi3411250 


Dictyostelium 
discoideum 


developmental protein 


94 


36 


1134 


gi3 600087 


Dictyostelium 
discoideum 


cytosolic regulator pianissimo 


94 


36 


1135 


gil3543686 


Homo sapiens 


Similar to RiKhN cDJN A 
4931428F02 gene, clone 
MGC: 14797 IMAGE:4064169, 
rnRNA, complete cds. 


1 AQA 
1450 




1135 


AAY07081 


Homo sapiens 


Renal cancer associated antigen 
precursor sequence. 


261 


40 


1135 


gi7020359 


Homo sapiens 


cDNA FLJ20333 fis, clone 
HEP 11252. 


206 


31 


1136 


gil4250138 


Homo sapiens 


Similar to RIKEN cDNA 


1377 


100 
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ID 
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jjescripnon 


c 


identitv 








5730421 El 8 gene, clone 

A/fnr , «1A7QR T\yf A CrT* -4064^91 

XTLXvTN.rV, UOXXipiCLC tUo. 






1136 


gil256896 


Saccharomyces 
cerevisiae 


Ylrl43wp 


552 


49 


1136 


gi995714 


Saccharomyces 

• ■ 

cerevisiae 


L3177 


552 


49 


1137 


gil0798580 


Homo sapiens 


mRNA for 17kD fetal brain protein. 


410 


93 


1137 


A A Y 57916 


Homo sapiens 


xiuman transmemDrane proiem 

JHL 1 ivlr IN -*f U. 


41 0 




1137 


gil498731 


Brassica napus 


pathogenesis-related protein PR1 


111 


35 


1138 


gio934248 


Homo sapiens 


tropomoauim 4 ^ 1 n\\jxj £ + ) mtviN a, 
compiere cas. 


X \JO X 


oo 


1138 


gil2744762 


Homo sapiens 


muscle tropomodulin (TMOD4) 
gene, coiiipieie t/Ut>. 


1081 


83 


1138 


gi6013189 


Homo sapiens 


Sk-tropomodulin (Sk-Tmod) 

JTXX\J/N.rY, UUXIipiCLC LUo. 


1081 


83 


1139 


gil841302 


Sus scrofa 


ribosomal protein 


516 


89 


1 1 jy 


glzUo 1 51 


ivauus 
norvegicus 


riDOSomai proiem Ljoa 


S1 6 




1139 


gil 666702 


Mus musculus 


ribosomal protein 


516 


89 


1140 


gil2653007 


Homo sapiens 


clone MGQ3180 IMAGE:3356277, 
mKJNA, complete cas. 


1220 


100 


1140 


AAG00664 


Homo sapiens 


Human secreted protein, SEQ ID 

aha ^ 
JNU: 4/4Z). 


619 


100 


1140 


AAG00663 


Homo sapiens 


Human secreted protein, SEQ ID 

XTr\. /I T/i /I 

JNU: 4 /44. 


489 


100 


1 "1 A 1 

1 141 


gil4250752 


Homo sapiens 


similar to nect uomain ana kj^u z, 
cione iviurv^. xhooo 

llVX/\VJX3.HOV/L/ / JO, XXXXVLN^V, L-UXX1LJXC/LC 

cds. 


son 


1 on 

xuu 


1 141 


gi4u /youy 


riomo sapiens 


li.JJ/ivv_/Z ^XlJj/Xv^Zy IIlXvi>l/\, OUIXipXCLC 

cds. 


764 


28 


11/11 
1 141 


gl34 /ojoU 


Araoiaopsis 
thaliana 


u v xj -resistance proiem u v xvo 


970 

Z / \J 

* 




1 142 


gil i Jzzyoy 


riomo sapiens 


xiC/O v gene. 


4R^ 




1142 


gil 1322968 


Homo sapiens 


HCG V mRNA. 


485 


90 


1 14z 


gii 1 /643s 


Homo sapiens 


iviriv^ ciass i region. 






1143 


gil2224835 


Homo sapiens 


mRNA; cDNA DKFZp547I082 
(from clone DKFZp547I082). 


622 


100 


1 143 


A A T~> r 1 /'/in 

AAB51649 


Homo sapiens 


riuman secreted, protem sequence 
encoded by gene 30 SEQ ID NO: 89. 


979 
Z /Z 


1 on 

X \J\J 


1143 


AAB75511 


Homo sapiens 


Human secreted protein sequence 
encoaed by gene o oii^ ijj jnu.od. 


272 


100 


1144 


AAB43827 


TT „ ^ 

Homo sapiens 


ld.uman cancer associate a protem 
sequence SEQ ID NO: 1272. 




R7 
o / 


1 144 


<ri1 ^77^484 
gXXO / / JH-O^T 


clCI 1UI XXa U U.1 IXo 

elegans 


similar to T*»la<smr»f"iiTm merozite 

surface antigen precursor 
(SP:P04933) 


359 


39 


1144 


gi6808143 


Homo sapiens 


mRNA; cDNA DKFZp761D05 1 
(from clone DKFZp761D051); 
partial cds. 


162 


36 


1145 


gil4549207 


Homo sapiens 


URAX1 mRNA, complete cds. 


3118 


100 


1145 


AAB27591 


Homo sapiens 


Human secreted protein SEQ ID 
NO: 92. 


2466 


98 
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1145 


gi5901836 


Drosopnila 
melanogaster 


BcDNA.GH09817 


A fll 


AO 


1146 


gil492084 


Molluscum 

contagiosum 

• i_ j 1 
virus subtype 1 


MC141R 


DO 




1146 


AAY72171 


Homo sapiens 


Human RNA metabolism protein 

/n a jT'cti i 1 \ 

(RMEP-1 1). 


58 


36 


1146 


gil2722186 


Pasteurella 
multocida 


SelA 


OO 




H47 


AAB53429 


Homo sapiens 


rluman colon cancer antigen protein 
sequence SEQ ID NO:969. 


1 1 1 R 




1 147 


AAG02922 


Homo sapiens 


xluman secreted protein, oxiv^ iu 
xto- *7nni 






1147 


gi3 123743 


synthetic 
construct 


echistatin 


58 


36 


1 149 


gi52977 


Jvlus museums 


™„ j'f: OT . o /A/TUX 

mouiiier o ^ivioo j 






H49 


gi2266988 


Mus musculus 


M33 polycomb-like protein 


2167 


82 


H49 


gi3860185 


Xenopus laevis 


Polycomb homolog Pc 1 






H50 


gil5213542 


Homo sapiens 


NSD1 (NSD1) mRNA, complete 
cds. 


1618 


100 


1 150 


gi3329465 


Mus musculus 


NSDl protein 




OA 


H50 


gi3249715 


Homo sapiens 


MMSET type I (WHSCl) mRNA, 
complete cds. 


86 


31 


H5l 


gil2005728 


Homo sapiens 


GL012 mRNA, complete cds. 


922 


100 


1 15 1 


gi258834 


Mus sp. 


alpha 1 (XII) collagen {triple-helical 
domain COL2} 


1 

Ol 


3 / 


1151 


gi4583418 


Homo sapiens 


Kruppel-like zinc finger 
transcription factor gene, complete 
cds. 


77 


46 


1152 


AAB47134 


Homo sapiens 


CDIFF-15, Incyte ID No. 
3478571CD1. 


1286 


59 


1152 


AAY27621 


Homo sapiens 


Human secreted protein encoded by 
gene No. 55. 


661 


70 


1152 


gi7022185 


Homo sapiens 


cDNA FLJ10260 fis, clone 
HEMBB 1000973, moderately 
similar to Mus musculus schlafen3 
rnKJNA. 


379 


44 


1153 


gil2654519 


TT _ " _ 

Homo sapiens 


clone lMAoEiiO 04989, mKINA, 
partial cds. 


l ouy 


QQ 


1153 


gi 10440 173 


Homo sapiens 


cDNA: FLJ23471 fis, clone 
Hbll 1969. 


470 


29 


1153 


gi 104404 16 

* 


Homo sapiens 


mRNA for FLJ00043 protein, partial 
cds. 


257 


45 


1154 


AAG03284 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7365. 


364 


100 


1154 


gi2853301 


Homo sapiens 


mucin (MUC3) mRNA, partial cds. 


111 


21 




gizDooU / 1 


Aenopus laevis 


Kinesm-reiaLcu. piuicni 




25 


1155 


AAB58405 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 743. 


814 


99 


1155 


gi9105937 


Xylella 

fastidiosa 9a5c 


acetylglutamate kinase 


505 


36 


1155 


gi5155 


Saccharomyces 
cerevisiae 


acetylglutamate kinase 


312 


28 


1156 


gil3436152 


Homo sapiens 


reticulocalbin 2, EF-hand calcium 
binding domain, clone MGC:1650 


1678 


100 
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S 
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identity 








IMAGE:3505241, mRNA, complete 
cds. 






1156 


gi469885 


Homo sapiens 


H.sapiens ERC-55 mRNA. 


1678 


100 


1156 


AAW21949 


TT * 

Homo sapiens 


E6-bmdmg protem Eo~BrbD /. 


lO/o 




1157 


gil293893 


Mus musculus 


leucine zipper protein 1 


4333 


80 


1157 


gi 14042584 


Homo sapiens 


cDNA FLJ 14799 fis, clone 
NT2RP4001351, weakly similar to 
Human ovarian cancer 
aownregulatea myosin neavy cnam 
homolog (Docl) mRNA. 


604 


37 


1157 


AAB93244 


Homo sapiens 


Human protein sequence SEQ ID 
NO:12253. 


604 


37 


1158 


gi9964287 


Homo sapiens 


hypertension-related calcium- 
regulated gene mRNA, complete 
cds. 


1 

1UJO 

1 


y4 


1158 


gil 0434820 


Homo sapiens 


cDNA FLJ13008 fis, clone 
NT2RP3000456. 


1036 


94 


1158 


gil2803673 


Homo sapiens 


HT002 protein; hypertension- 
related calcium-regulated gene, 
clone MGC:3418 IMAGE: 3 606279, 
mKJNA, complete cos. 


1036 


94 


1159 


gi3387790 


Homo sapiens 


protem tyrosine pnospnatase JriK-i 
mRNA, complete cds. 




Q7 


1159 


gil2653157 


Homo sapiens 


dual specificity phosphatase 1 1 
(KJNA/KNr complex 1 -interacting;, 
clone MGC:8576 IMAGE:2960939, 
mRNA, complete cds. 


mi 


97 


1159 


AAG01376 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5457. 


04/ 


1 nn 


1160 


♦ a /~\ -i /Tin 

gi5901688 


K A' 1 

Mus musculus 


GRIN 1 


Zl / 


zz 


1160 


gi557822 


Saccharomyces 

» » 

cerevisiae 


mal5, stal, len: 1367, CAI: 0.3, 

A ■\/I"\./"LT "\/"C A OTP A 

AMYrt_Y.bAol JrUo04U 

m T Tr^O A A/TVT A QT7 Q 1 /'PP 1 O 1 1^ 
OJLU LAJAlVi Y J-AoJC/ ol ^xiO o.Z. 1 . d) 


206 


20 


1160 


gil304387 


Saccharomyces 

* ■ 

cerevisiae var. 
diastaticus 


glucoamylase 


206 


20 


1161 


gil28036Q3 


Homo sapiens 


clone ivnjrL>.4ioi iiviao±}. jouoyy^t, 
mRNA, complete cds. 




i no 


1161 


AAB21042 


Homo sapiens 


Human nucleic acid-binding protein, 

XTn A DD A A 

INU/Vr5ir-40. 


397 


51 


1 1 zT 1 

1161 


gi4097497 




Mus musculus 


zinc tmger protem V4 


D 1 O 




1162 


gi!3661556 


Homo sapiens 


extracellular glycoprotein EMILIN- 
z precursor, mKJN A, complete cas. 


1410 


98 


1162 


gil45 17602 


Homo sapiens 


mRNA for FOAP-10 protein, partial 
cds. 


1092 


99 


1162 


gil4043093 


TT * 

Homo sapiens 


—1 — X 4" / — 1 . 1 f 1 AO 

clone MGC: 15203 
cds. 




4D 


1163 


gi3 11946 


Homo sapiens 


H.sapiens pi 8 mRNA. 


85 


31 


1163 


gi4927776 


Homo sapiens 


alternative HHLA3 protein 
(HHLA3) mRNA, complete cds. 


63 


36 


1163 


gil3883114 


Mycobacterium 

tuberculosis 

CDC1551 


helicase, UvrD/Rep family 


83 


43 


1164 


gil2053149 


Homo sapiens 


mRNA; cDNA DKPZp434G2226 


4649 


100 
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Species 


JJesci iption 


score 


r citcin 

identity 








(trom Clone lJKrZ,p4j4<jrZZZ0J, 

complete cas. 






1 1 tZA 

1 lo4 


gizzjyz4z 


ocnizosaccnarom 
yces pomoc 


Kinesin-iiKe protein 






1164 


gil 881662 


Drosophila 
meianogasier 


kinesin like protein 67a 


962 


43 


1165 


gil5186738 


Mus musculus 


Tac2~N 


2229 


87 


1 1 /CC 

1 loD 


A A PAO G A1 


riomo sapiens 


riuman secreted protein, orsv^ iu 
NO: 7623. 


tuo 


07 


1 165 


giol367oo 


Mus musculus 


synaptotagmm vll 


ZU / 




1166 


gi2760351 


Girardia tigrina 


myosin heavy chain 


197 


23 


1 166 


gi3986l94 


Dugesia japomca 


myosin heavy chain 


1 O 1 

iy i 


zz 


1166 


gill067 


Schistosoma 
mansoni 


myosin II heavy chain 


182 


21 


1167 


A A Y5 6021 


T T * 

Homo sapiens 


Human CD40 receptor interacting 
protein 4C4. 


Z1Z1 


99 


1167 


gi3638955 


Homo sapiens 


T*» A /"""t 1 . TP* T» A nciTTn -C. n„o f 

PAC clone RP4-751H13 from 7q35- 
qter, complete sequence. 


zyo 


ZZ 


1167 


gi4063770 


Ipomoea 
purpurea 


transposase 


i on 

130 


15 


116S 


gi7022811 


Homo sapiens 


TWT A T'T T1 (~\ /I A C\ £1 „ „ 1 ^ 

cDNA FLJ 10649 lis, clone 
NT2RP2005835, weakly similar to 
bliPl rK01.bJ.JN. 


1 lo 


Zo 


H68 


AAB9303 1 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11803. 


116 


26 


1 168 


gi7800642 


Streptococcus 
pneumoniae 


FspA 


1 O 1 

1Z1 




1169 


gil5081398 


Homo sapiens 


kruppel-like zinc finger protein 
(^JNJtjUU) mKJNA, complete cos. 


218 


53 


H69 


gi340448 


Homo sapiens 


Human zinc-finger protein 8 (ZFP8) 
mKJNA, 3 end. 


211 


64 


H69 


gil4602511 


Homo sapiens 


Similar to zinc finger protein 274, 
clone MCjC:1 1d9 lMA(jli:zyoo/o4, 
rnRNA, complete cds. 


201 


52 


1 170 


gi22333 


Zea mays 


nydroxyproline-rich glycoprotem 


c\r\ 

90 


1A 

Z4 


1170 


gi4007865 


Zea mays 


Hydroxyproline-rich Glycoprotein 

/T_TT") /~^T» \ 

(HKCrr) 


90 


26 


1170 


gi257041 


Zea mays 


hydroxyproline-rich glycoprotein, 
HRGP 


90 


26 


1 171 


gi 10440267 


T T * 

Homo sapiens 


cDNA: FLJ23544 fis, clone 
LNG08336. 


695 


1 f\r\ 

100 


-1 ■* >~~t -4 

1171 


AAG02534 


Homo sapiens 


Human secreted protein, SEQ ID 
JNO: 6615. 


61 


40 


1172 


gil 343 6440 


Homo sapiens 


clone MGC:4400 IMAGE: 2 905 97 6, 
rnRNA, complete cds. 


2543 


99 


1172 


gil 0434258 


Homo sapiens 


cDNA FLJ12644 fis, clone 

in izKivi4UUiy /y, weaKiy similar to 

ZINC FINGER PROTEIN 84. 


909 


57 


1172 


AAB94205 


Homo sapiens 


Human protein sequence SEQ ID 
NO:14550. 


909 


57 


1173 


gi6855513 


Gallus gallus 


syndesmos 


417 


64 


1173 


gil3623247 


Homo sapiens 


Similar to RIKEN cDNA 
1 1 10001K21 gene, clone 
MGQ11275 IMAGE:39443 55, 
rnRNA, complete cds. 


403 


62 
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c 
o 

score 


X CI vvll 1. 

identity 


1173 


gil 5 157405 


Agrobacterium 
tumefaciens 


AGR_C_4093p 


86 


36 


1174 


gil3310782 


Mus musculus 


myoneurin 


237 


33 


1174 


gi498736 


Homo sapiens 


H.sapiens HZF9 mRNA for zinc 
finger protein. 


227 


51 


1174 


gil0086309 


Mus musculus 


Kruppel-type zmc finger protein 
KROX-25 


... 

ZL 1 


cry 


1175 


AAB58899 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence b-bv^ 1JJ 
607. 


OHO 


QO 


1175 


gi9967699 


Schizosaccharom 
yces pombe 


protein with weak similarity to 
Azospirillum brasilense nifR3 
proteni; yeast YMLUoUW nomolog, 
Ur\rUU34 aomain 


732 

•■ 


50 


1175 


gil5146316 


Arabidopsis 
thaliana 


AT5g67220/K2 1H1_1 8 


687 


51 


1176 


gill558109 


Homo sapiens 


mRNA for PRAM-1 protein. 


2373 


87 


1176 


gil3938357 


Homo sapiens 


Similar to R1KEN cDJN A 
0610030H11 gene, clone 
MGC:15716 IMAGE: 33 53 224, 
niKJNA, complete cas. 




o / 


1176 


gi59 17666 


Zea mays 


extensin-like protein 


466 


28 


1177 


gi 10432603 


Homo sapiens 


cDNA rU 1 1360 lis, clone 
HEMBA 1000231. 






1177 


AAB93711 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13305. 


3044 


100 


1177 


gil3872809 


Homo sapiens 


mRNA for artemis. 


3001 


99 


1178 


gil2382773 


Homo sapiens 


caspase recruitment domain protein 
1 1 mRNA, complete cds. 


1999 


99 


1178 


gil3488607 


Homo sapiens 


caspase recruitment domain protein 
10 mRNA, complete cds. 


309 


33 


1178 


gil4192725 


Homo sapiens 


CARD-contammg MAGUK 3 
protein (CARMA3) mRNA, 
complete cds. 


1 AO 




1179 


gil2652571 


Homo sapiens 


integral type 1 protein, clone 
MGC:1302 IMAGE: 3 507308, 
mRNA, complete cds. 


oy i 


i no 


1179 


gil4790009 


Homo sapiens 


Similar to integral type I protein, 
clone MGC:8938 IMAGE: 3 876465, 
mRNA, complete cds. 


691 


100 


1179 


gi4583677 


Homo sapiens 


mRNA for integral type I protem 
p24B (p26). 


/CO 1 

oy i 


1 no 
1UU 


1180 


AAB47327 


Homo sapiens 


FCTR4. 


3068 


99 


1180 


gi3041860 


Homo sapiens 


PAC clone RP4-545C24 from 7q21- 
q22, complete sequence. 


12/ J 




1180 


gil4603161 


Homo sapiens 


Rho guamne nucleotide exchange 

IMAGE:3636857, mRNA, complete 
cds. 


1 T7C 




1181 


AAW79092 


Homo sapiens 


Human secreted protein dn740 3 . 


724 


70 


1181 


AAY73380 


Homo sapiens 


HTRM clone 1821233 protein 
sequence. 


724 


70 


1181 


gil3543342 


Homo sapiens 


clone IMAGE:2821841, mRNA, 
partial cds. 


585 


36 


1182 


gil3097648 


Homo sapiens 


clone IMAGE:3610995, mRNA, 


782 


100 
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ID 
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Species 


uescriptioii 


c 

score 


identity 








partial cds. 






1182 


gi70 19947 


Homo sapiens 


cDNA FLJ20080 fis, clone 

LaJ-LUd 184. 


112 


26 


1182 


gil0433190 


Homo sapiens 


cDNA FLJ1 1837 fis, clone 
HEMBA1006612. 


105 


26 


1183 


gil3097648 


Homo sapiens 


clone IMA(jrb:3oluy9D ? mKJNA, 
partial cds. 


/ oZ 


1 nn 


1183 


gi8977890 


Homo sapiens 


TP»-vT A £.11 1 xt-. * „ ^ rt "T\\T A 

rriRNA full length insert cDJNA 
clone EUROIMAGE 364773. 


1 1 o 

1 lo 




1183 


gi5817164 


T T * 

Homo sapiens 


mRNA; cDNA DKrZp4J4H2U4 
(from clone DKFZp434H204). 


1 1 Q 

1 lo 




1184 


gil0434709 


T T • 

Homo sapiens 


cDNA FLJ12934 lis, clone 
NT2RP2004978, weakly similar to 

AC 1 IJN-i^livll rKU 1 HUN Alvl o . 






1184 


AAB94379 


Homo sapiens 


Human protein sequence SEQ ID 
IN (J: 14yz8. 


3306 


99 


1184 


gil0439016 


Homo sapiens 


cDNA: FLJ22579 fis, clone 
HSI02562. 


1978 


100 


1185 


gil 160355 


Caenorhabditis 
elegans 


UNC-89 


307 


27 


1185 


gi8250181 


Drosophila 
melanogaster 


D-Titin 


319 


22 


1186 


gil3276231 


Homo sapiens 


mRNA for FYVE and coiled-coil 
domain contaimng 1 (J^YCUl genej. 


7470 


100 


1186 


gil3938195 


Homo sapiens 


FYVE and coiled-coil domain 
contammg 1, clone JVKjC:14yy8 
IMAGE:3050704, mRNA, complete 
cds. 


1271 


98 


1186 


gil0435317 


Homo sapiens 


cDNA FLJ13335 fis, clone 
OVARC1 00 1861. 


1263 


99 


1187 


gil0435317 


Homo sapiens 


cDNA FLJ13335 fis, clone 
OVARC1001861. 


1290 


100 


1187 


AAB94590 


T T * 

Homo sapiens 


Human protein sequence SEQ ID 
NO:15398. 




1UU 


1187 


gil3938195 


TT * ^ _ 

Homo sapiens 


r Y Vh and coilea-cou domain 
containing i, clone ivior^.i^yyo 
IMAGE:3050704, mRNA, complete 
cus. 


1 9£S 
1ZOJ 


yo 


1 1 o o 

1188 


gllzJZ) jz / 


riomo sapiens 


numan myogioom gciic ^cauh i j 
(and joined CDS). 


809 


100 


1 1 o o 

1 188 


gliooo /z 


riomo sapiens 


xiuman myogioom gene, cxon j. 


806 


99 


1 1 o o 

1 188 


„4 1 C /I ^7 


c>us scrota 


myogiooin 


758 


93 


1189 


gil4587078 


Danio rerio 


Unci 19c 


857 


69 


1189 


gil4587076 


Danio rerio 


Unci 19b 


/ oz 


7^ 


1189 


gill61378 


Rattus 

■ 

norvegicus 


retinal protein 


700 


71 


1 1 on 

i iyv 


gi/Tv/H- / ji 


AAiiq mncrnlii? 

lVJ.Uo illU.ol/UJLU.3 


af1pnv1r><3iippinate svnthetase 


2353 


96 


1190 


gi415849 


Homo sapiens 


Human adenylosuccinate synthetase 
mRNA. 


1864 


77 


1190 


gi404057 


Mus musculus 


adenylosuccinate synthetase 


1842 


76 


1191 


gil5081791 


Arabidopsis 
thaliana 


Atlg09280/T12M4_1 


527 


36 


1191 


gil2541446 


Corynebacterium 
glutamicum 


RXC00552 


447 


34 


1191 


gil3702647 


Staphylococcus 


conserved hypotehtical protein 


439 


34 
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S 

score 


Percent 
identity 






aureus subsp, 

ot it* oil o fvT a 1 ^ 

aUlcUS 1NO 1 D 








1192 


gi2970646 


Mus musculus 


Xin 


3622 


53 


1 1 no 


„ 1 , On r 7 f\£.A A 

gl/y /U044 


oanus ganus 


yvin 


u i j 


47 


1192 


gil5029903 


Mus musculus 


Similar to proline-rich protein BstNI 
subfamily 2 


202 


27 


1193 


gil2804921 


Homo sapiens 


oimilar to membrane pro tern or 
cnoiinergic synaptic vesicies, cione 
MGC:2671 IMAGE:3547155, 
rnrvi>/\, cornpieie cus. 


1 <<a 


1 (\(\ 
1UU 


1 1 on 


rr-; 1 /io <n^4^ 

gll4Z3U340 


xiomo sapiens 


ClOne IVIO^.JU ID llVJLrVVJJIy.J IDZJHj, 
ml? TvT A rr* Tnt^l F^i'F* r*rlc 




1 on 


1193 


gil 698401 


Homo sapiens 


Human BRCA1, Rho7 and vatl 

gCIlCb, L/OllipiCLC CLIlS, cLLLkX xjJLO ~> gCIICj 

partial cds. 


1553 


99 


1 1 Q/1 
1 iy4 


gl!4DyoLO / 


nomo sapiens 






99 

yy 


1194 


gil 42499 11 


Homo sapiens 


clone IMAGE:3506202, mRNA, 

palXlal CCIS, 


3528 


100 


1194 


gil4597918 


Homo sapiens 


human CLASP-3 


3156 


72 


I i o< 

I I y_> 


gil i4yo4 1 / 


xiomo sapiens 


T)T> /^l 1 0Q0 


1AA 

Z^fH- 


1 nn 

1UU 


1 1 

1 lyo 


gllD /4Jo 1 


riaemopnuus 
lniiuenzae sssx 


lie - 1 operon pro xeni ^ncv^. ) 




^7 


1195 


gil3359399 


Oserya 
coulteriana 


maturase K 


41 


53 


1 1 Q£ 

1 1 y O 


/\/\_D DU40 / 


nomo sapiens 


xxuman ceil cycie dnu proiiierauuii 
nrntein fPYPR-1 S WD TD NO'1 S 


079 
y i 


1 on 


1196 


AAB75130 


Homo sapiens 


Human SCAP 22 protein sequence 


968 


99 


1196 


gil2805505 


Mus musculus 


Similar to CHMP 1 .5 protein 


958 


97 


1197 


gil 1559490 


Homo sapiens 


VDUP1 gene, complete cds. 


1631 


99 


1197 


gi77 17464 


Homo sapiens 


brain-expressed HHCPA78 homolog 
vj_/UJri ^vjrenej m±sJi>/\, complete 
cds. 


1631 


99 


1 1 0^7 

i iy / 


rri 1 1 1 Q 1 /CO /I 

gil 1 loloZ4 


JVlus museums 


tmoieaoxm interacting ractor 


1 JOU 




1198 


gil4906268 


Homo sapiens 


Pur-beta (PURB) mRNA, complete 
cds. 


1638 


100 


1198 


gi2460119 


Mus musculus 


vascular actin single-stranded DNA- 
Dinamg lactor z p44 component, 
purine-rich single-stranded DNA- 
binding protein beta; PurB beta 


1569 


94 


1 1 A O 

1 19o 


giiyo /so 


Homo sapiens 


H. sapiens Pur (pur-alpha) mRNA, 
complete cds. 


y /y 


/ 1 


1 199 


gilU4J8372 


Homo sapiens 


_t\\t A . T?T TOO 1 f\A -C« 

cDJNA: .bLJzzKM lis, clone 
HEP 17633. 


loo / 


on 

yy 


1 1 no 

i iyy 


glD44iy52 


Homo sapiens 


peroxisomal membrane protein PMP 
Z4 mKJNA, complete cas. 


25 / 


OQ 

Zo 


1 1 99 




bclLJICllo 


K.J_y lI HILL lllollx 1 JLltlllUl clll^ ^Jl. W 1^/111, 

clone MGC:1213 IMAGE:3 533572, 
mRNA, complete cds. 


87 


28 


1200 


gil 03 9447 


Saccharomyces 
cerevisiae 


Lpblp 


243 


43 


1200 


gil 17 1427 


Saccharomyces 
cerevisiae 


Ypl030wp 


121 


46 


1200 


gi6318215 


Mus musculus 


E-selectin ligand 1 (ESL-1) 


97 


31 


1201 


gil3529542 


Mus musculus 


RIKEN cDNA 4731402F03 gene 


609 


45 
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Tin T"n 

All I lil 




Tl p ri n ti n n 


s 

score 


Percent 
identity 


1201 


AAW50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


74 


48 


1ZU1 


gll 1 DDy^fUO 


V^aiUS iamillarib 


LltjpdilllllC 1CL<C|JIIJJ. J_/*t 


86 


57 


1202 


gil3561413 


Homo sapiens 


F-BOX domain protein mRNA, 
complete cds. 


2076 


99 


1ZUZ 


glOJOl os> 1 


xiomo sapiens 


ITlllbCie CllbCd.oC-ICld.lCU. piULClil 

rnRNA, complete cds. 


kJ O 


40 


1ZUZ 


gii to i iyj 


ivius mus cuius 




96 


38 


1203 


gi2 19980 


Homo sapiens 


Human mRNA for paraneoplastic 

OClCUClldl LlCgCllCldLlUll d-ooVJOlcllCU. 

antigen, complete cds. 


2243 


100 


1 oni 
IZlo 


glZ4 1 / / / 


numan, mivLN/\, 
2192 nt], [Homo 

cam pnc 
odJJICllo 


ZOu IvlN .rV, au.ltJdllLlgCll !CuVJglllZ<CU. 

by an anti-neuronal cell antibody 


2243 


100 

i v/vy 


1203 


gil80187 


Homo sapiens 


Human major Yo paraneoplastic 
antigen (CDR2) mRNA, 3' end. 


2206 


96 


1ZUH- 


gll 15 / uuuo 


JxdllUb 

• 

JLJLUl V 1^ Ho 


■nnfl^siT* mnfriv ■h*aiiQr'TiT>tion TftpfriT 

illiL/lCdl llld-LLlA. LI Clllo^l 1JJ 11U11 laL LVJ1 


737 


81 


1204 


gi6729087 


Rattus 
norvegicus 


Cas-associated zinc finger protein 


719 


80 


1204 


gil 1870000 


Rattus 

• 

norvegicus 


nuclear matrix transcription factor 


719 


80 


1205 


gil4043225 


Homo sapiens 


clone MGC: 15678 
IMAGE:3350074, mRNA, complete 

CQS. 


470 


100 


1205 


gil3537206 


Homo sapiens 


hMBLR mRNA, complete cds. 


467 


85 


1 1AC 

1205 


gl!4U4Zooj 


xiomo sapiens 


cjl/IN/\ ri^j 1^4-y /y us, cione 

Y79AA1000037, moderately similar 
tr» tyma T^TTsjnrNjrj ppotftnf rtvtt- 

l(J LylN/\-I3±IN J_y 11N vJ x 1 IJ/IIN JDlVll- 
1. 


*+D / 


S5 


1206 


gil2052744 


Homo sapiens 


mRNA; cDNA DKFZp564K0322 

l 11 Ulll LiUHC J_-/i>vJ. Z-i^J^.' UtJVUJZ,Z, y , 

complete cds. 


2777 


99 


1 ZUO 


gi^yj 1V4 


r^iasmouium 
falciparum 


Q O T^l*1 (Tfin 

O-dllll^Cll 


1 7ft 

I/O 


27 


1 ZUO 


rri 1 AA1 OH 

gnouiyu 


r ias mo aium 


CllCUIllopUIUZ(UlLC dllllgCIl 


1 SO 


35 


1207 


gil2053315 


Homo sapiens 


mRNA; cDNA DKFZp586K0717 
Tfrom clone DKF7r»S86K0717V 
complete cds. 


2748 


97 


1 907 


A AW74S09 


T-Tnmr> canipn c 
XXUlllU adJJICllo 


TTmnan sppfpfpH nrntpin PTK^nHfifl bv 

11U111U11 uvulv LwLl UlUlwlll WlvUUvU J 

eene 73 clone HSOEL25 


2748 


97 


1207 


AAB56895 


Homo sapiens 


Human prostate cancer antigen 
r>rotein seauence SEO ID NO 1473 


2748 


97 


1208 


gil53781 


Streptococcus sp. 


protein H precursor 


154 


26 


1 ZUo 


gl!Z4 / / 3D 


otrepio coccus 
tjvoffenes 


pioiciii n 






1208 


gi557785 


Saccharomyces 
cerevisiae 


tropomyosin 


132 


28 


1209 


gil4336699 


Homo sapiens 


16pl3.3 sequence section 2 of 8. 


101 


30 


1209 


gil0179425 


Homo sapiens 


Rb-associated protein mRNA, 
complete cds. 


73 


25 


1209 


AAW88833 


Homo sapiens 


Polypeptide fragment encoded by 
gene 70. 


66 


40 


1210 


gil 043 5 667 


Homo sapiens 


cDNA FLJ13611 fis, clone 


1817 


100 
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X CI vCHl 

identity 














1210 


AAB95495 


Homo sapiens 


Human protein sequence SEQ ID 

MO* 1 RfY*7 
1NU. 1 oUj / . 


1817 


100 


1210 


gil5080536 


Homo sapiens 


Similar to RIKEN cDNA 
2410002022 gene, clone 
MGC:21036 IMAGE:4508 655, 
mKJNA, complete cas. 


1026 


100 


121 1 


■ i A/in C/Z/ZH 

gllU4i5oo/ 


Homo sapiens 


~TA"kT A T7T -Pic /MrvnA 

CJJJNA JrJ-J i joi i us, cione 
PLACE1010802. 


1 779 




121 1 


AAB954y5 


Homo sapiens 


xiuman protein sequence ^.ea^ iu 
NO: 18037. 


1 779 
1 / / Z 


OR 


101 1 

121 1 


gil50o053o 


Homo sapiens 


oimiiar to KJ.iv±iiN clmna 
2410002022 gene, clone 

1vio^.ziu.jO iivx/\\jrri < £ + jvjod j j, 

mRNA, complete cds. 


QR1 

y o i 


07 


1212 


gil2 16477 


Mus musculus 


zinc finger protein 60 


138 


28 


1212 


gio / 60445 


Homo sapiens 


omaa- ana un- interacting zinc 
finger protein mRNA, partial cds. 


1 AO 


99 


1212 


gi4753764 


Homo sapiens 


OZF gene exon 5 (and complete 


128 


24 


1213 


gi268 


Bos taurus 


cytochrome c oxidase subunit VIb 
(AA 1-86) 


445 


90 


1213 


gil2654383 


Homo sapiens 


cytochrome c oxidase subunit VIb, 

Clone JVlvjV^.3UOl IlvlAOxi.J.5'H+ /Ul, 

rnjx±N/\, complete cas. 


423 


83 


1 Ol o 

121.5 


rri 1 OQmT) 1 


riomo sapiens 


cyxocnrome c oxiaase suduiiil v id, 
clone MGC:1153 IMAGE:3347455, 
mRNA, complete cds. 


49^ 


OJ 


ill/I 

1214 


glliz /yiJD 


rlomo sapiens 


piecKStrin nomoiogy, oec/ ana 
coiled/coil domains 2 (cytohesin-2), 
clone MGC:642 IMAGE:3538580, 
inrviN/\, complete cub. 


1 7^7 
1 / O / 


1 no 


1214 


gil 575766 


Homo sapiens 


cytohesin-2 mRNA, complete cds. 


1767 


100 


1214 , 


gi3660538 


Mus musculus 


cytohesin 2 


1756 


99 


1 1 1 c 

1215 


gil2052736 


Homo sapiens 


mRNA; cDJNA DKrZpjo4Ml izz 
(from clone DKFZp564Hl 122); 
complete cds. 


1 /Uz 


1UU 


1 1 1 c 

1215 


* 1 CIO ^7 £~ 1 

gil5126751 


Homo sapiens 


clone JVLvjrC:l /ol4 
IMACjli: Jo jZj jo, mKJNA, complete 
cds. 


yu i 


yo 


1215 


gil3937992 


Homo sapiens 


clone MGC: 14768 
lMA(jrii:4zy lyUz, mKJNA, complete 
cds. 


1123 


100 


1 O 1 /Z 

121o 


AAJ363J1 J 


Homo sapiens 


rluman breast cancer associatea 
antigen protein sequence SEQ ID 
NO:675. 


ID** 


% 1 

O 1 


1216 


cn'1 4^6799 


Tlntrin compile 
XJAJlilU aAJJICLlo 


1 fW\ 1 ^ ^ Q^rmPTiff* Qpption ^ of* R 


149 


32 


1216 


AAB63312 


Homo sapiens 


Human breast cancer associated 
antigen protein sequence SEQ ID 
NO: 674. 


132 


34 


1217 


gi8096269 


Nicotiana 
tabacum 


KED 


162 


19 


1217 


gil2803875 


Homo sapiens 


Similar to splicing factor, 
arginine/serine-rich 4, clone 
MGC:3920 IMAGE:3619538, 


160 


24 
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mRNA, complete cds. 






121/ 


gllU_OU34 


uaenoinaDaitis 

clcgallS 


cooed ior uy v^,. eiegans uj-^injtv 
cDNA ykl27b8.3 


I J7 


26 


1 0 1 2 




XIOIIIU bapiCXlb 


HnmaTi r\T , r»«ii'5itf : * r*finrpr antio'ft'n 

nrotein seauence SEO ID NO* 1 184 


792 


96 


1218 


AAG01959 


Homo sapiens 


Human secreted protein, SEQ ID 
NO* 6040 


567 


99 


1218 


gi9802603 


Arabidopsis 

1*ri film Tin 

Lllu.llcllJ.cl 


T2E6.7 


70 


29 


1219 


gil373252 


Mus musculus 


STONE14 


1270 


82 


1219 


gil938245 


Rattus 

« 

IlUlVCglOUo 


PKC-zeta-interacting protein (ZIP) 


1248 


82 


1219 


gil3543730 


Mus musculus 


Similar to sequestosome 1 


1271 


83 


IzzU 


ot\ 1 C\A A (\^OC\ 

gllU44UDoU 


riomo sapiens 


rrixviN/\ ior jti^juui i j piuLcm, pctiuai 
cds. 


4008 


99 


1 T)A 

1ZZU 


gioy i / / du 


jtiomo sapiens 


TT -r«-r\tp>i-n T -Minn TT T~NI A ^ 

r-Dt)A proicm rwiiiiia ^xvii^iiNrv^ 
mRNA, complete cds. 


2467 


99 


1 


rr-ICO 1 QKOO 

gioy loj/z 


riomo sapiens 


jr^^^vZ iiir\J-N r\ ior protein 
containing CXXC domain 2, partial 
cds. 


o / w 


7? 


1 OO 1 

Izzl 


glO lzOZ>40 


riomo sapiens 


Clone iivi/\v_iri.o oDojyv, irirvi>j/\, 
partial cds. 


1 1 17 

X X D I 




1 OO 1 

lzzl 


gl 13430290 


riomo sapiens 


cione iivi/vorri. jo iu / iz, iill\.in/-v, 
partial cds. 


oZo 


QO 


lzzl 


gll4UJyo4j 


riomo sapiens 


testes ueveiopment-reiaieii in i u- 
SP18 mRNA, complete cds. 


I JU 


10 


1222 


gi6599138 


Homo sapiens 


mRNA; cDNA DKFZp434I036 
^irom cione j^ivjrz.pH-o'+iwjuy, paitidi 
cds. 


102 


24 


1 000 
LZZZ 


gl4jZZ^U4 


iiomo sapiens 


LI alio la. LIU 11 llllLlaLUJIi JLctOLUl IX Z, 

mRNA, complete cds. 


102 


24 


1 000 
1ZZZ 


glDUUZO^f J 


riomo sapiens 


lllXvIN xtl. 1UI UallblclUAJXl xlXxLXa.llL/xx 

factor 2 (IF2 gene). 


1 02 


24 


1223 


gil3276645 


Homo sapiens 


mRNA; cDNA DKFZp761F0123 
^irom uione r/ivr z^p / uirui zj ^, 

complete cds. 


1999 


100 


1 000. 
IZZd 


gl!4 /yuuo4 


xiomo sapiens 


cione liVLfvvjxj/.oo / oouu, iixx\j.n^-\., 
partial cds. 


1 595 


100 

X \J\J 


1ZZ3 


gl3 1 10Zl4 


riomo sapiens 


mixiN/v ior olio Dinaing protein, 
complete cds. 


541 


45 


~\ OO A 

LZZh 


gllzlDilDj 


riomo sapiens 


nilvIN/V, ClJlN/V UAJ7Z,p4j411 1 lO 

(from clone DKFZp434Il 1 16); 

t~* ntnvx 1 cx\~cx A/in 

COIXipiCLC CQSi 


4714 


QO 

yy 


1224 


gil0438819 


Homo sapiens 


cDNA: FLJ22429 fis, clone 
ni\.v^uyuo4. 


1450 


97 


1274 


oil 041 R61 6 

g,l 1 UtJOU 1 u 


XXVJlllw ouUlt'llo 


cDNA- FLJ22291 fis clone 
HRC04410. 


1956 


99 


1225 


AAB21042 


Homo sapiens 


Human nucleic acid-binding protein, 
NuABP-46. 


2498 


99 


1225 


gi8099348 


Homo sapiens 


zinc finger protein (ZFP) mRNA, 
complete cds. 


960 


55 


1225 


gi9963806 


Homo sapiens 


zinc finger protein 2TNF287 
(ZNF287) mRNA, complete cds. 


924 


56 


1226 


gi22 17970 


Homo sapiens 


mRNA for Rab9 effector p40, 


1116 


81 
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complete cds. 






1226 


gil2653463 


Homo sapiens 


Rab9 effector p40, clone 
MGP-R4S9 TMAGF*2821471 

IVIvJVw'iOt J7 JLlVX.Ti.vJ i-j . j-j X ^ 1 ij 

rriRNA, complete cds. 


1113 


81 


1ZZO 


AADjojjO 


xiomo sapiens 


sequence SEQ ID 674. 


520 


62 


1ZZ / 


gizz i /y /u 


nomo sapiens 


complete cds. 


1115 

_L J. X 


71 


1ZZ / 


gllZODJHOJ 


nomo sapiens 


IvoD-/ C1ICOLUI ptv, C1U11C 

MGC:8459 IMAGE:2821471 5 

mPTSJA rTimnlftp rnQ 


1112 


71 

1 X 


1227 


AAB58336 


Homo sapiens 


Lung cancer associated polypeptide 
<?ermenc*e SFO TD 674 


812 


64 


1228 


gi3015538 


Homo sapiens 


nuclear dual-specificity phosphatase 
fSBFl") mRNA Dartial cds 


2977 


62 


1228 


gi3851594 


Drosophila 
melanogaster 


SET domain binding factor 


1396 


43 


IZZo 


gllZZZH-y /O 


T— T r\ xy\ c\ ocmipno 
X1U111U adJJldlo 


mRNA- rDNA DKF7r>667L246 
(from clone DKFZo 667L246} 

1 11 VJXXX vivllv X— ^ XVX V/ V/ / XJ^w i v / « 


1198 

X X w 


100 


1 99Q 




T-fnmrt c ^ tri f*x\ q 
JlJ.VJ111W odpiciio 


lonirvnel-lilre 7inc finder nrotein 

xVx U-UL/vl 11A.W ZJXXXw XXJJLc^wl Lvxxx 

(ZNF300^ mRNA comolete cds. 


796 


54 


1229 


gil3752754 


Homo sapiens 


zinc finger 1111 mRNA, complete 
cds 


761 


54 


1229 


gi881564 


Homo sapiens 


Human zinc finger containing 
protein ZNF157 (ZNF157) mRNA, 
comolete cds 

VV/XXAL/XV W V/ • 


931 


47 


1230 


gi2351568 


Mus musculus 


N-RAP 


1530 


57 




glOVJD JOx. 


T-T r\ x\~\ r\ c qtm p»n c 
ITLLUllU odjJlCllo 


TT •sfmif^TiQ tnRNA fnr rifHiilin 


533 

S w* 


31 


1230 


gil205990 


Homo sapiens 


Human nebulin mRNA, partial cds. 


453 


34 


1231 


gil4250424 


Homo sapiens 


clone IMAGE:3869590, mRNA, 

pdiLlal CUb. 


2252 


99 


1231 


gil3491282 


Caenorhabditis 
eiegans 


contains similarity to Pfam family 
score=5L7, E=1.6e-ll, N=3 


273 


23 


1911 


gijZ>yyo / u 


uiciyosteiium 
discoideum 


TP FA 

x SSJT r\ 


1 12 

X X 


26 


1232 


gi5911990 


Homo sapiens 


mRNA; cDNA DKFZp434K1235 

ffrnm r l n ne DKF7r»4^4K 1 23 5 V 

\i-i will L-1VJ11C J-/JNJL jLjIJ'-vJ'-tS^ 1 a J *J J 5 

partial cds. 


667 


100 


1232 


gil 1095215 


Streptomyces 

■ i * • • 
1 1 bllJLL lCIld lo 


cytochrome P-450 


113 


21 


1232 


gi6691467 


Triticum 
aestivum 


AHM1 


123 


29 


1 9n 


rri 1 <A'7QC9C 

gllDU /yszo 


T-T /^1*V1 /"V o o 1 Qf \ C" 

xiomo sapiens 


rlrmi* 1VTGP- 1 070S 

IMAGE:3926284, mRNA, complete 


1 179 

x x i y 


100 


1233 


12224994 


Homo saniens 


mRNA: cDNA DKFZp564M0163 
(from clone DKFZp564M0163); 
complete cds. 


196 


28 


1233 


AAY73338 


Homo sapiens 


HTRM clone 2019742 protein 
sequence. 


194 


28 


1234 


gil2005904 


Homo sapiens 


AD036 mRNA, complete cds. 


751 


100 


1234 


gi2073099 


Phycomyces 
blakesleeanus 


chitin synthase 


81 


35 


1235 


gi4886503 


Homo sapiens 


mRNA; cDNA DKFZp564C0469 


1401 


98 
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/"from rlnnp TT^F7ViS6<dP046QV 

partial cds. 






1 OTC 

1Z JD 


rri 1 'IQ'IQ 1 1 H 
gll 1 1U 


ivius muscuius 


oimiiar 10 lopoisomerabe yuis^j jlx 
binding protein 






1235 


gi3845613 


Homo sapiens 


mRNA for DNA topoisomerase II 
DinQing pxoiein, uoiripxcie t/Ut>. 


204 


27 


1236 


gi9945010 


Mus musculus 


RING-finger protein MURF 


1778 


95 


1Z.50 


gll.510U.3oo 


xiomo sapiens 


rnKJ-NA lor JxiNJr ju gene ror ring 
finger protein 30. 


1 11 A 


yy 


1Z30 


A/\±>ziU4o 


xiomo sapiens 


xiuman nucleic acia-Dinaing proiem, 
NuABP-52. 


1 1AA 
1 /HH 




1 on 


gll iyUoUUU 


xiomo sapiens 


r>\~,L,-o corepressor snort isoiorm 
(BCOR) mRNA, complete cds; 
alternatively spiiceu. 


OA /Z 


1 nn 

1UU 


1237 


gil 1907998 


Homo sapiens 


BCL-6 corepressor (BCOR) mRNA, 
complete cqs, alternatively spncea. 


5246 


99 


1237 


gil3592175 


Leishmania 
major 


PPg3 


203 


24 


lzio 


gllzoDji /o 


Homo sapiens 


partial mKJNA tor oxoaicarooxyiate 
carrier ^uu^ genej. 


1 DDZ 


1UU 


1238 


gil2655649 


Rattus 

* 

norvegicus 


motochondrial oxodicarboxylate 

• 

carrier 


1300 


82 


1238 


gil 050774 


Saccharomyces 
cerevisiae 


YOR50-12 


491 


37 


1 O^Q 

IZoy 


rri 1 IClKICll ^ 

gllZUD jUZj 


xiomo sapiens 


mivlN/Y, CJJ1>IA 1^/XVJT /LpHj^ru / l^t 

(from clone DKFZp434P0714); 
complete cds. 




00 

yy 


LZjy 


fTI 1 ^I^AO 1 
gll 0ZO4Z 1 


ivius muscuius 


lv/\Jr j>xj 


07 

y i 




1239 


gi3645904 


Homo sapiens 


Human Smg GDS-associated protein 

OlVl/VlT IXLIVlN/\, UUIIipiCLC tllo. 


99 


25 


1240 


gi3811111 


Homo sapiens 


Human l(3)mbt protein homolog 
ixiiviN /\, complete cos. 


1445 


49 


1240 


AAY01069 


Homo sapiens 


Human l(3)mbt protein sequence. 


1445 


49 


1240 


gi5817146 


Homo sapiens 


mRNA; cDNA DKFZp586P1522 
^rrom clone UssJ? zjpjoor i dzzj, 
partial cds. 


935 


50 


1 OA 1 

1Z41 


gllU43 ly 15 


Homo sapiens 


clJJNA: rJLJzloUl tis, clone 
HEP00707. 


/^C 1 
DO 1 


oo 

yy 


1 1/1 1 

1241 


A A PAA7 /1 1 

* 


Homo sapiens 


Human secreted protem, bxi(j 1JD 
NO: 4424. 


111 

All 


OA 


1241 


gil2052965 


Homo sapiens 


mRNA; cDNA UisJ H ZpjooMl(J4o 
(from clone DKFZp566M1046); 
complete cus. 


lyo 


zy 


1242 


gil5029844 


Homo sapiens 


clone IMAGE:3451454, mRNA, 
partial cds. 


2315 


99 


1242 


gil4043625 


Homo sapiens 


clone IMAGE:3139971, mRNA, 

pal Licti uui. 


2303 


99 


1242 


gil3325200 


Homo sapiens 


clone IMAGE:3659680, mRNA, 
partial cds. 


1906 


99 


1243 


gil4330448 


Homo sapiens 


mRNA for zinc finger protein 
RINZF (RINZF gene). 


4285 


95 


1243 


gi4557143 


Rattus 
norvegicus 


zinc finger protein RIN ZF 


1668 


89 


1243 


gil 0434431 


Homo sapiens 


cDNA FLJ 12752 fis, clone 
NT2RP2001 174, weakly similar to 


1343 


100 
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GASTRULA ZINC FINGER 

t>t> OTTh TKF YT r^dTiA^ 1 
r\KLJ 1 xlliN JVJLL^Orr^O. 1 . 






1244 


AAG03897 


Homo sapiens 


Human secreted protein, SEQ ID 

"MTV 


551 


91 


1244 


gil5080686 


Lentinula edodes 


CDC5 


91 


26 


1 1 A A 

1244 


gll401U3j4 


rlomo sapiens 


tropomyosin 4-anapiastic lympnoma 
kinase fusion protein minor isoform 
mtvLNA, partial cus. 


OJ 


JU 


1z4j 


AA x J0yo4 


riomo sapiens 


Jtixienaea. numan becrcicLi piuiciii 

cpmipnrp ^sTnO TD "MO 911 


OJ7 


00 


1 OAS 


gllZ / 1 oZUo 


INeUXObpOla 

crassa 


icidlcu. LU Uocuuuui ivj.iiiC' oyiiiiiaoc; 


261 




1ZHO 




rbCUtlUIIlOIlab 
t\ pm cri n rwp 


tT?TsJA ncpiirlrMiriHiTiP cvntTifiQp 
IXVIN poCU.vJ.4JUilU.lJ.lC -J *J ojriiiiiciac 




41 


1246 


gi3025832 


Onchocerca 

vol vnln q 


pyrrolidone-rich antigen 


68 


41 


1246 


gi 15 077042 


Bos taurus 


survival motor neuron protein 


83 


32 


IZH-O 


criQ1 ^908 




ability 111UV/111 


91 


24 


1247 


gi55471 


Mus musculus 


Zfp-29 


1203 


52 


1Z4 / 




nomo sapiens 


zinc imger proicin z^rsr iou 
(ZNF180) mRNA, complete cds. 


1 1 00 




1 1 /I T 


gl!3o /yz4U 


JVlus museums 


oimiiar to zinc linger protein *to 


1 1 07 
i ±y / 


JJ 


1248 


AAB56403 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO:98 1 . 


813 


98 


1 <-> A O 

1248 


gi4992U4 


Drosophila 
melanogaster 


D-i3-A-D box protein 


4U / 


D 1 


1248 


gi/zo8333 


Arabia* op sis 
thaliana 


A i jT-aepenaent jkjna neiicase iiKe 
protein 


JO/ 


*XA 


1249 


gill096137 


Homo sapiens 


MAGEF1 (MAGEF1) mRNA, 
complete cds. 


1592 


100 


1249 


gil4603190 


TT„ „ • 

Homo sapiens 


MACjJbrl protein, clone 
MGC: 19617 IMAGE: 3 9433 84, 
mRNA, complete cds. 




i nn 
1UU 


1 <1 /(A 

1249 


AAB60476 


TT * 

Homo sapiens 


Human cell cycle and proliferation 

protein L^Ux r\K-Z4, oxiv^ ID INd.Z^f. 


l jyz 


1UU 


1250 


gil4043262 


Homo sapiens 


Similar to RIKEN cDNA 
i3uuuz0it>iij gene, cione 
MGC: 15737 IMAGE:3355622, 
rrix\j.N/\, complete luo. 


1076 


100 


1250 


gil4250512 


Homo sapiens 


clone MGC: 15468 
nviAijrjti.zyooyz i, mtciNA, complete 
cds. 


1076 


100 


1250 


gil2006223 


Homo sapiens 


NPD017 mRNA, complete cds. 


464 


48 


1251 


gil2652831 


TT „ • „ 

Homo sapiens 


clone MCjC:5z4z !MA(jrli:zy0UzUo, 
mKJNA, complete cas. 


00 / 


1UU 




A A "\/o ^ r\'~i r\ 

AAY87070 


Homo sapiens 


Human secreted pro tern sequence 
SFO TD NO- 109 


DOO 


QQ 


1251 


AAY87165 


Homo sapiens 


Human secreted protein sequence 
SEQ ID NO:204. 


566 


98 


1252 


gi6808293 


Homo sapiens 


mRNA; cDNA DKFZp434A1010 
(from clone DKFZp434A1010); 
partial cds. 


2951 


76 


1252 


gil5145797 


Sus scrofa 


basic proline-rich protein 


340 


31 


1252 


gi600118 


Zea mays 


extensin-like protein 


300 


26 


1253 


gil5139362 


Mus musculus 


aczonin 


6067 


96 
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gl / 


T? a f+n c 
XS.aL.lUo 

norvegicus 


mnltiHomnin "nrpQvnantip pvtomatHx 

jprotein Piccolo 


6046 


95 


1253 


gi6433844 


Gallus gallus 


aczonin 


5521 


87 


1Zj4 


giio loyjoz 


ivius muscuius 


■ 

aczonm 


\J\J\J ± 




1Zj4 


gi/4yjoJo 


Kattus 
norvcgiL/Ua 


muitiaomam presynaptic oyiuiiia.iiix 

f\TVi'f - P a iTi P i /"■<"* r\ 1 f\ 
pUJlClIl JT 






1254 


gi6433844 


Gallus gallus 


aczonin 


5455 


86 


1255 


gi409389 


Rattus 
norvegicus 


dC-stretch binding protein (CSBP) 


1793 


80 


1255 


gi241478 


human, mRNA, 
2302 nt]. [Homo 
sapiens 


heterogeneous nuclear 
ribonucleoprotein complex K 


1793 


80 


1255 


AAY39467 


Homo sapiens 


Heterogenous ribonuclear protein K. 


1793 


80 


1256 


AAW47589 


Homo sapiens 


T-cell receptor beta-chain. 


537 


91 


1256 


gil552495 


Homo sapiens 


Human germline T-cell receptor beta 
chain Dopamine-beta-hydroxylase- 
like, TRYl, TRY2, TRY3, 
TCRBV27S1P, 
TCRBV22S1A2N1T, 
TCRBV9S1A1T, 
TCRBV7S1A1N2T, 
TCRBV5S1A1T, TCRBV13S3, 
TCRBV6S7P, TCRBV7S3A2T, 
TCRBV13S2A1T, 
TCRBV9S2A2PT, 
TCRBV7S2A1N4T, 
TCRBV13S9/13S2A1T, 
TCRBV6S5A1N1, TCRBV30S1P, 
TCRBV31S1, TCRBV13S5, 
TCRBV6S1A1N1, TCRBV32S1P, 
TCRBV5S5P, TCRBVISIAINI, 
TCRBV12S2A1T, TCRBV21S1, 
1CRJ3V81S4P, ICKBVlzbi, 
TCRB V2 1 S3 A2N2T, TCRBV8S5P, 
lUKrJVlJol genes rrom bases i to 
267156 (section 1 of 3). 


534 


99 




glJOZlQO/ 


Homo sapiens 


l cell receptor beta cnain variable 
region (BV22S1A2N1) mRNA, 
partial cas. 


J-5Z 




1257 


gi8096269 


Nicotiana 
tabacum 


KED 


320 


25 


1257 


gi7549210 


Babesia 
i • 

bigemma 


200 kDa antigen p200 


333 


25 


1257 


gi6624128 


Homo sapiens 


PAC clone RP4-687K1 from 14, 
complete sequence. 


288 


28 


IZOo 


gllU44UUZ0 


riomo sapiens 


cuiNA. rLJzjjoj us, cione 
HEP 15507 


1 4^R 

It J o 


i no 


1258 


gil2805037 


Homo sapiens 


clone MGC:5306 IMAGE: 3460203, 
mRNA, complete cds. 


1438 


100 


1258 


AAB43628 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1073. 


1101 


99 


1259 


gi3435244 


Homo sapiens 


centriole associated protein CEP 1 10 
mRNA, complete cds. 


4915 


99 


1259 


gi485858 


Mus musculus 


IB3/5-polypeptide 


1728 


55 


1259 


gi9916 


Plasmodium 


liver stage antigen 


556 


24 



WO 03/029271 



PCT/US02/30474 



162 
Table 2A 



ID 
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AA1 

species 


JLJcSCl lpiIOIl 


score 


identity 






ia ic ip arum 








1ZOU 


gl /U 1 yo IJ 


nomo Sapiens 


CJ_/IN/\ J? 1_>J ZUUU.} IIS, ClOllC 

ADKA01794. 




52 


1 9^n 
1 zou 


gijo / ouz> i 


i^aenornaDaiiis 
elegans 


T-T9 1 PO^ 9 


1 79 


32 


1ZOU 


glOo ~> /Uj 


O aCCllai Ollly CCS 

cerevisiae 


OR T9 


158 


25 


1261 


gi 14043 196 


Homo sapiens 


clone IMAGE:3 162799, mRNA, 
partial cds. 


1247 


100 


1ZO 1 


glrfUO / jo 


nomo sapiens 


ri. sapiens onx> iiirviN.r\.. 


OUl 


46 


JLZOl 


AA I U /UhU 


xiomo sapiens 


Jtsreast cancer associated antigen 

piCCUloOl oCLj LlCJLxCC 


O \J X 


46 


1262 


gil0242353 


Homo sapiens 


pellino 2 (PELI2) mRNA, complete 

CU.S. 


2290 


100 


1262 


gil4550457 


Homo sapiens 


pellino (Drosophila) homolog 2, 
clone MGC: 15066 

cds. 


2290 


100 


JLZOZ 


m*1 0949^7 
gllUZnZ.j.J / 


1V1US IILUS CUIUS 


penmu z 


911^ 


99 


1263 


gi212657 


Gallus gallus 


smooth muscle caldesmon 


150 


36 


lZoi 


gi/ j4yziu 


rsaoesia 
bigemina 


zuu Kua antigen pzuu 


1 Al 


9^ 
zo 


iZoJ 


rri/^^QQ 1 "JO 

gitoyy iJo 


liomo sapiens 


mxviN/\, CUlN/\ JJJfsJrZjpHJH-lUJjO 

(from clone DKFZp434I036); partial 
cus. 




zo 


1264 


gi632549 


Petromyzon 

• 

iildl 1X1US 


NF-180 


247 


27 


1264 


gi9837381 


Mus musculus 


retinitis pigmentosa GTPase 

Tf* cm 1 n tr»f 
i c^uiaLVJi 


215 


33 


1 OfiA 


gl / V/JJ^T JU 


LJ1 CaJJlaaiiia 

parvum serovar 3 


iiiuiiLijpic uaiiucLi aiiLigcii 


211 


28 


1ZOJ 


gi / jyjjyj i 


noiliu Sapiciib 


LipiLULl ^HJWtll laClOl J.CCGJJIAJJ. 

mRNA, complete cds. 


R2 


28 


1265 


AAY92809 


Homo sapiens 


Human opioid growth factor 
receptor spliced version 8. . 


82 


28 




nil A n^Q^A 


ivius musculus 


in system amino acias rransponer 

XT AT 1 
1>I A 1-1 


70 


^9 


1266 


gil2751100 


Homo sapiens 


PNAS-127 mRNA, complete cds. 


380 


100 


1 9AA 


niA&A A 1 Q7 


/vraDiaopsis 
thaliana 


1 ZJxvo.D 


uo 


9R 


IZoo 


gii jyzj jo4 


iniiueiiza a virus 
(A/Hong 
Kong/5 14/97(H5 

Ml)) 


kjn A polymerase 


*J7 


97 
Z / 


1267 


gill61 1473 


Mus musculus 


Deltex3 


1858 


98 


1267 


AAY70439 


Homo sapiens 


Human JN otcn signalling protem, 


*\1C\ 
D /U 


A ^ 


1267 


AAY70440 


Homo sapiens 


Human Notch signalling protein, 
Deltex (hZDX)-4. 


370 


45 


1268 


gil4042300 


Homo sapiens 


cDNA FLJ14640 fis, clone 
NT2RP2001460, weakly similar to 
TRICHOHYALIN. 


2667 


99 


1268 


AAB95168 


Homo sapiens 


Human protein sequence SEQ ID 
NO:17221. 


2667 


99 


1268 


gil3278316 


Mus musculus 


Similar to RIKEN cDNA 


2182 


70 
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zoiuou/iao gene 






1269 


gi60 16842 


Mus musculus 


nuclear protein ZAP 


2869 


92 




AA Y /Z loo 


Homo sapiens 


Human JKJN A metabolism protein 
(RMEP-8). 


z /yo 


1 nn 
1UU 


lzoy 


glOO /3 /O 


Homo sapiens 


(clone zap3) mRJNA, 3 end. ot cds. 


1 /zo 


y / 


1270 


gil2803769 


Homo sapiens 


clone MGC:3411 IMAGE: 3 62 9947, 
mRNA, complete cds. 


660 


100 


1270 


gi4689250 


Homo sapiens 


sorting nexin 5 (SNX5) mRNA, 
complete cds. 


92 


95 


1270 


gi7023288 


Homo sapiens 


cDNA FLJ10931 fis, clone 
UVAKC 1 (JUUD04. 


92 


95 


1271 


AAG00714 


Homo sapiens 


Human secreted protein, SEQ ID 
JNU: 4 /yj. 


527 


99 


1271 


gil 732239 


Mycoplasma 
iermentans 


monocytic differentiation/ activation 
iactor 


87 


20 


1271 


g i4324459 


Schizosaccharom 
yces pomoe 


caffeine-induced death protein 2 


78 


22 


1272 


gi2414626 


Schizosaccharom 
yces pomoe 


ubiquitin regulatory domain (UBX) 
protein 


368 


30 


1272 


gi9 14992 


Saccharomyces 
cerevisiae 


Ydr330wp 


314 


35 


12 II 


gi7527718 


Arabidopsis 
thaliana 


T5E21.7 


341 


37 


1273 


gi4263747 


Homo sapiens 


PAC clone RP4-771P4 from 
7qll.21-qll.23, complete sequence. 


729 


79 


1273 


gi2827180 


Homo sapiens 


general transcription factor 2-1 
(GTF2I) mRNA, complete cds. 


729 


79 


1273 


gi2827203 


Homo sapiens 


general transcription factor 2-1 
(GTF2I) mRNA, alternatively 
spliced product, complete cds. 


729 


79 


1274 


gil4042066 


TT * 

Homo sapiens 


cDNA FLJ 14503 fis, clone 
NT2RM 1000252, weakly similar to 
H.sapiens E-MAP-1 15 mRNA. 


1397 


75 


1 1*7/1 

1274 


AAB94010 


Homo sapiens 


TT j r~l 1 — i / — \ T-r\ 

Human protein sequence SEQ ID 
NO:14130. 


1397 


r—i r- 

75 


1 T7/1 
1Z /4 


gi / jo \y<$ j 


Homo sapiens 


rnKJNA tor E-MAr-1 15/105 (MAr 
gene). 


421 


34 


1275 


gil0439810 


Homo sapiens 


cDNA: FLJ23201 fis, clone 

J^AIAjoo / Z. 


1349 


99 


1275 


gil3097606 


Homo sapiens 


clone MGC: 10765 
lMA(jrli:3oUoi41, mRNA, complete 

COS. 


927 


97 


1Z /O 


gi44y:>uyj 


jvius museums 


erytmoia cell-specilic ana testis- 
specific protein l(ERT-l) 




3o 


1 

1Z /O ; 


glOUZ43o 


jdos taurus 


pno spnoprotern 


you 


n i 

y i 


1276 


gil 043 6906 


Homo sapiens 


cDNA: FLJ20940 fis, clone 

/\UoI3U 1 OUO. 


886 


100 


1276 


gi7243755 


Homo sapiens 


neuronal phosphoprotein DARPP-32 
mRNA, partial cds. 


508 


100 


1277 


gi3811376 


Mus musculus 


Bingl 


350 


37 


1277 


gi4050099 


Mus musculus 


BING1 


350 


37 


1277 


gi2952308 


Homo sapiens 


zinc finger transcription factor 
(ZNF-X) mRNA, complete cds. 


314 


48 


1278 


gil4042529 


Homo sapiens 


cDNA FLJ 14768 fis, clone 
NT2RP3004125, moderately similar 


2742 


99 
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atUl c 


Percent 








to Mus musculus zinc finger protein 
splice variant i/iZvi-ij ^jtizi j iijjcvin^y. 






1278 


AAB95788 


Homo sapiens 


Human protein sequence SEQ ID 

"MTV 1 874^ 
1NU. 1 O / H~D t 


2742 


99 


1278 


gi5579307 


Mus musculus 


zinc finger protein splice variant 

W71 A 


2323 


86 


1279 


gil5162128 


Agrobacterium 
lumeiac lens 


AGR_pAT 493p 


90 


34 


1279 


gil016116 


Cyanophora 
paraaoxa 


ribosomal protein LI 


81 


25 


lZoU 


rri 1 C\A 1A 1 ^7 


rionio sapiens 


^TYNTA T7T T1 9^89 fic n*\r\-ne* 

cl/in/v rx^tj izooz us, cione 
NT2RM4001151. 


R1 1 
oil 


QQ 


1280 


AAB94144 


Homo sapiens 


Human protein sequence SEQ ID 

■ 1 A A 1 a 


811 


99 


1280 


gil2653823 


Homo sapiens 


clone MGG3101 IMAGE:3350198, 
mRNA, complete cds. 


763 


100 


1 1 O 1 

lzol 


A ADCOOA1 

AABDioUl 


Homo sapiens 


Human colon cancer antigen pro tern 
sequence SEQ ID NO:134L 


3U / 


yD 




A A ID HO 1 K 

AArsyy 1 lo 


Homo sapiens 


rluman protein o-bt^J iu 14. 


1 1 
/ 1 


JZ 


lzol 


A A X1&1A <A 

AAJdo34j>4 


rlomo sapiens 


rluman ore as t cancer associated 
antigen protein sequence SEQ ID 

1NU.O ID. 




DZ 


1282 


gi 14042423 


Homo sapiens 


cDNA FU14714 fis, clone 

in izxsJr->uui iv/, weaKiy similar 10 

PEREGRIN. 


2618 


99 


1 9R9 
IZoZ 


x\i\£jy j\J¥D 


jnomo sapiens 


fiuniaii proiem bequenL/c oxiv*/ iu 




0Q 

yy 


1 989 

1Z.OZ 


ai6Q7Q01 0 
givjy / zs\j i y 


imjlliw aajJiciio 




5?7 


~> ~j 


1 9K^ 


cril 04^9849 

glll/'+jZOT-^ 


JTlvJlllW odJJlCllo 


pH>JA FT T1 1^61 fi<; rloTie 

HEMBA1003142. 




1 00 


1 9K^ 


A ARQ4Q97 


11VJ111W aajJlC/llo 


XlU.lllu.ll jJl VJ LClll oCHUCllL/C OJ-jy ll_y 

NO: 16392. 




1 00 

1 WW 


1 98^ 

1 ZrOJ 


A API 0R71 


JTHJIIIU bopiCllb 


XjAUII 111 CllL-UClCU. Uy gCHAJllll^/ lllC^ - 

CSF clone. 






1 984 

1 ZOH 


ail 7S96^R 


XlOniO bdpiCIlb 


lIllviN/\ 1U1 IV 1 I'+j UUllipiCLC ULlb. 


U J w 


1 on 

1 WW 


1284 


gi2769585 


Mus musculus 


GCN5L1 protein 


603 


96 


1Z54 


gllUoLo 30 1 


<^iona mtestmans 


nuclear lamin 


5Z 


97 


1285 


-AAY77471 


Homo sapiens 


Human deubiquitinating protein 

T\,,t«1 1 OTTO TFV XTO«^/l 

JJUD1 1, oxiV^ 1U 1N^J.j4. 


1273 


93 


1285 


AAW30711 


Homo sapiens 


Human ubiquitin-specific thiol 
protease jjud Ujoj /o. 


1229 


94 


1285 


AAY77470 


Homo sapiens 


Human deubiquitinating protein 
Dubll, SEQ ID NO:32. 


1234 


94 


1286 


gil0439995 


Homo sapiens 


cDNA: FLJ23342 fis, clone 

TIT "CD nr\ 


2647 


99 


1286 


gil2224847 


Homo sapiens 


mRNA; cDNA DKFZp667A213 
^iiom cione LJi\.r zjpoo / J\£ id ). 


1743 


100 


1286 


gil3561988 


Argiope 
trifasciata 


major ampullate spidroin 2-like 
protein 


123 


42 


1287 


gil 5080040 


Homo sapiens 


Similar to RIKEN cDNA 
231 0008 J22 gene, clone 
MGC:19531 IMAGE:4336762, 
mRNA, complete cds. 


708 


46 


1287 


AAB95390 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17738. 


703 


53 
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1287 


gi6599183 


T T * 

Homo sapiens 


T> "N.T A TATvT A TVT/"C"7^ A ^1 A T~> 1 C 1 /I 

mRNA; cDNA DKFZp434PlM4 
(from clone DKFZp434P1514); 
partial cds. 


Do / 


AO 

4y 


1288 


AAB95390 


Homo sapiens 


Human protein sequence SEQ ID 

"MY*"*!. 1 HHI Q 

JNU: 1 / /3o. 


/U3 




1288 


gil 5080040 


Homo sapiens 


Similar to RIKEN cDNA 
231 0008 J22 gene, clone 

JVltjC:iyj31 lMA<jrxi.4J^o /oz, 

mRNA, complete cds. 


637 


50 


1285 




Homo sapiens 


„._T) "NT A • «n\T A TMTX77'i-»/1 1/1*01 < 1 A 

nilvJN A; CJJJNA JLJJvrZ/p4^4r'lD 14 

^rrom cione uisjrz^p^f jh-it ij ihj, 
partial cds. 


All 




1 0 CO 




jrsorriDyx mon 


orniuune QcCdrDoxyiabc aniizyinc 


/ J 




1289 


gi331156 


Human T-cell 
lymphotropic 
virus type i 


env polyprotein (partial) 


51 


45 


1289 


gi8980338 


Takifugu 
ruDnpes 


FRANK2 protein 


76 


48 




gllZ /DlDy I 


Jtiomo sapiens 


xiVvjri niKiNA, complete cas. 


1 1 c\& 

1 1UO 


1 uu 


1290 


AAB64938 


Homo sapiens 


Human secreted protein sequence 
encoaea oy gene d oJbv^i iu jnvj. i 10. 


87 


33 


1 TOA 

129U 


AAis 04949 


Homo sapiens 


riuman secrete a protein sequence 
encoded by gene 10 SEQ ID 

1NV^. 1Z / . 


O / 


3D 


1291 


gil263289 


Araneus 
diadematus 


flbroin-4 


146 


26 




AA Y VOIZj 


nomo sapiens 


*oOiiagen type 111 aipna- 1 . 


1 AO 


97 


1291 


gi930045 


Homo sapiens 


Human COL3A1 mRNA for pro 
aipna-i ^nij conagen. 


142 


27 


1292 


gi915208 


Sus scrofa 


gastric mucin 


287 


25 


1 TOO 

izyz 


gll3D9zl /D 


Leishmania 
majoi 


PPS 3 


ZyZ 


Of* 
ZO 


1292 


gi557822 


Saccharomyces 

■ • 

cerevisiae 


mal5, stal, len: 1367, CAI: 0.3, 

A1VI xxl I JQ/-VO I JrUOO^+U 

GLUCOAMYLASE SI (EC 3.2.1.3) 


269 


24 


1 9Q1 




xiomo bctpiens 


"CJi7t-vio-n r*\rr*'\c* t^TV\1"i"f"p*f citizen 

numdn ceil cycie dnu proiiieiaiiun 
protein CCYPR-6, SEQ ID NO: 6. 


"t?Z_> 


i on 


1293 


gi8920230 


Homo sapiens 


partial mRNA for Spir-1 protein 
(bpir-1 gene). 


385 


42 


1293 


gi4107015 


Ciona savignyi 


PEM-5 


286 


39 


1294 


glli548492 


Caenornabaitis 
elegans 


ZC41U. /b 


3 j4 


A A 
44 


1294 


gi4996286 


Arabidopsis 
thaliana 


lipoyltransferase 


343 


40 


1294 


gi2494127 


Arabidopsis 
thaliana 


Contains similarity to 
Mycobacterium LIPB gene 

V& u | v < AUHUt l J. 


343 


40 


1295 


gil81097 


Homo sapiens 


Human gamma- A-crystallin gene 
(gamma-G5), exon 3. 


968 


99 


1295 


gi387135 


Mus musculus 


gamma- A-crystallin 


845 


84 


1295 


gi203627 


Rattus 
norvegicus 


gamma- A-crystallin 


837 


83 


1296 


AAG02474 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6555. 


266 


96 


1296 


gil0518509 


Streptomyces 


B-N-acetylhexosaminidase 


69 


33 
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plicatus 








1296 


gi7544050 


Streptomyces 

pnplipolrvr A1^9^ 


beta-N-acetylhexosamimdase 


69 


30 


1297 


gi7021900 


Homo sapiens 


cDNA FLJ10065 fis, clone 
HEMBA1001455. 


981 


38 


1297 


AAB92496 


Homo sapiens 


Human protein sequence SEQ ID 

IN w. IUJ7O. 


981 


38 


1297 


gi6581093 


Mus musculus 


transposase-like protein 


924 


36 


1 OQQ 

lZyo 


gll/UOUoZU 


riomo sapiens 


serologically ueimou urcdbi uaxxucx 
antigen NY-BR- 15 rnRNA, partial 
cds. 


777^ 


1 00 


1 OQQ 


rri 1 OzLl AlOl 


riomo sapiens 


pTYNJA FT T1 1076 fk rlnnp 
civ in f\ r i-i j x d y / u xxb, cxdxxc 

Y79AA1001594, weakly similar to 
HYAT T JRONAN-MEDIATED 
MOTILITY RECEPTOR. 


O ~J o 


37 


1 9 OR 




xxuxxiu bdpiciib 


Human ■m*otpm Qprmpnpp SIT^O TTj 

11U.111u.11 piVJLClll 5Cl[UUHtC Ol/y 11> 

NO: 18344. 


858 


37 


1 900 


ail 90S1000 


11U111U bttpiUllb 


rnRNA- cDNA DKFZr>434A171 

llli-Vl^Xi., \jU 1. > £\. 1/xVJ. IJ • »J • ri X / jl 

(from clone DKFZp434A171); 

nomnl ete cds 

vv/lllk/lv Lv vUu • 


907 


66 


1299 


AAB28628 


Homo sapiens 


Human Bl lAgl antigen splice 
isoformBllC-15 


596 


49 


1299 


AAB28629 


Hnmo ^aniens: 


Human Bl lAffl antiffen stxlice 

XXtiiJklVill J— ' J. X X £3 X UllUUVXX kJL/lXW 

isoformBHC-8. 


596 


49 


1300 


gil2804657 


Homo sapiens 


clone IMAGE: 3354845, rnRNA, 

nartial cds 

U(U Udl CIlJ . 


914 


100 


1300 


AAG01406 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5487. 


421 


100 


1 ^nn 


A A nOORJ.0 


xTLvjUXXU bapXCXXa 


T-Tnirmn QppTpfpH trrntpin Sl"Pf"j TTj 

XlUlllall dC^/lClCU piULClll, OlvV^ 1U 

NO: 4921. 


92 


28 


i ^ni 

1 jU jl 


ail 11 1 1 4R0 


XXUXXXU isapXCXXo 


ml?TSJA "frvr TsJpcVi"R iP p nmn 1 P t P pHq 

1 1 IXvl > jr\ XLIX INCoXXXjJT, L/L!XX1|JXC/LC/ 


21 67 


84 


1301 


gi5912261 


Homo sapiens 


rnRNA; cDNA DKFZp586L2024 

\^li VJlll U1LJ11C JJIVTZjJJ JOUl/^ v/.'t^ j 

partial cds. 


1995 


99 


1301 


AAE01791 


Homo sapiens 


Human gene 22 encoded secreted 

piULClIl nWD.LT UVJ, OXZ/V^/ XXV 

NO: 112. 


970 


70 




A A V1A1 90 

J\t\. I D^t 1Z7 


XJ.UJ.I.UJ bap 1C lib 


T-Tiimcin nntacciiim fh^tmpl 
XTX.U.1X1CIXX ULlLdooXlAlll Olldllllt'l 

K+Hnov28. 


1 92 


34 


1 109 


A AU0S901 


XXL/XXHJ odpXCXXo 


TTiiman nrntpin cpnnpnpp SIFO 1 V) 
XTXUIXICIXX UXLILC11X &vL[LiCll^'C' ul/y xx-' 

NO: 17295. 


192 


34 

— j r 


1 109 


A A 71 1 007 

aal 


XXUXXIU oapiCIlo 


T-TiiTnciTi nntiiooiinn piiottppI 
XTXLXlXXcllX pULd.bblU.lll L/lXctXlXld 

K+Hnov28 cDNA (5' splice variant 
1). 


192 


34 


1 101 


A A*R9R040 


xthjxxiu oapicxxo 


T-TnTYmTi cppT*p1*pr? TM*r>tpin SI H.f) 1 Yj 

1 X 111 1 lull oCLICICU piULClll ul/y XJ— ' 

NO: 97. 


204 


100 


1101 

1 -J \J —J 


pi881 171 


cerevisiae 


ORP* M^etliod* c oncen tual 
translation supplied by author 


71 


34 


1303 


gi927780 


Saccharomyces 
cerevisiae 


Ydr511wp; CAI: 0.13 


71 


34 


1304 


AAB50651 


Homo sapiens 


Human UNC-5H1 protein 
translation in frame 3 sequence SEQ 
EDNO:10. 


93 


32 


1304 


gi2232272 


Schizophyllum 
commune 


B2-aldehyde-fonriing enzyme 


85 


30 
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1304 


gi9800242 


rat 

CyiO lllc g al o v lrus 

Maastricht 


pr5 


90 


30 


IjUj 






rDNA- FT T?^04? fis clone 
LNG02323 


955 

~r w' S 


100 


1305 


gil041087 


Mus musculus 


synapsin II 


85 


32 






rioirio bapienis 


IIlClllUIClllC Clo&L/L'Kll.CU. JVXLidaw ^JLVJ-JfLX ^ 

mRNA, complete cds. 


96 


27 


1 1C\& 
IDVV 


rr^ 1 1 /17A1 1 

gil 14 /Ol 1 


oaccnaroixiyces 
cerevisiae 


T rvRr* 
x^pxop 


136 


28 




gii^yzu /4 


ivie uiano co ecus 
jannaschii 


LiviN r\ llxlXUIl CIlUHJi.lU.L'XCcloC ^CllLLrt. ^ 




32 


1306 


gi3093358 


Mus musculus 


SPR2A protein 


74 


40 


1307 


gil 1225489 


Homo sapiens 


8q22.1 region and MTG8 
^ljlj.t/\zi i ^ gene, pdiiiai tut». 


94 


57 


1307 


gi2981446 


Homo sapiens 


MTG8-related protein (MTGR2) 
gene, exon iu ana paniai cas. 


91 


53 


1307 


gil0437096 


Homo sapiens 


cDNA: FLJ21080 fis, clone 

C A QH9/1/1Q 
L^AoUZ44y . 


108 


51 


1308 


gil0433656 


Homo sapiens 


cDNA FLJ12229 fis, clone 
iVi/\.iviivi/\ luuiioi, weaKiy biirxiiai 
to ABC1 PROTEIN HOMOLOG 
PRECURSOR. 


813 


95 


1308 


AAB93936 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13939. 


813 


95 


1 O AO 

1308 


gil l jyjo43 


JNeurospora 
crassa 


prooaoie aoc i proiein precursor 


IH-t 


L T\J 


1309 


AAW30650 


Homo sapiens 


Human secreted protein clone bg249 
1 protein. 


514 


36 


1 1 aa 
1 J09 


■ n ono Am 

glio /o(Jy3 


i^aenoma d cutis 
elegans 


ormiiariry wim arosomia ivioir-ouvj 
protein (PIR acc. no. S30431), 
contains similarity to Pfam domain: 

Score=90.2, E-value=1.4e-23, N=l 


1 AS 
XH J 




1 1 AO 


gl4DoZ0 / 1 


vjraiius gaiius 


xiypenon pro tern, ^+ 1 y jsaj lauiumi 


1 SO 


97 


1 1 1 A 

lo IU 


'A CO A 1 O /I 

gl4oo4134 


riomo sapiens 


lTLtvlNx\, CUlN/\ JJ j\T £jp J O DELu J 1 i' 

(from clone DKFZp586H0519); 








gllZ / ZO /\JD 


j_*acxococcuo 
lactis subsp. 

IdL- Llo 


xx x x \J x xx xl x xv_^r\.x-r rivu x j—'Xxn 


61 


34 


1 T1 A 

13 1U 


•A A V70.7 1 & 
J\J\ x / UZ 1 0 


luomo sapiens 


Pqt+iciI In i m q ti TTrff^i-lf^nlriTi.— 1 f^t^QllAn 
raulal IlLiillall XlllCXlClXJSJXl -X vpolXUXA 

protein. 


58 


35 


1311 


gi3132823 


Bos taurus 


ribosomal protein L30 


794 


100 


1311 


gi5 60493 


Rattus 
norvegicus 


ribosomal protein L24 


794 


100 


1^11 

IjU 


gi izoUjzoy 


A/lnc "mil c i 1 n c 


ti'Vi ac ATn3 1 nrntpm T 0 A 


794 


100 


1312 


gil0437683 


Homo sapiens 


cDNA: FLJ21562 fis, clone 
COL06420. 


2858 


99 


1312 


gi7899288 


Homo sapiens 


B2 gene partial cDNA, clone B2E. 


398 


34 


1312 


gi2076894 


Caenorhabditis 
elegans 


short region of weak similarity to 
protein kinase C; contains similarity 
to Pfam domain PF00130 
(DAG PE-bind), Score=10.0, E- 
value=0.0034, N=l 


156 


36 
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1313 


gil2053069 


Homo sapiens 


mRNA; cDNA DKFZp434J0617 

*-vi n i A „« TWTl T 7-r\A'\A THAI *7V 

(irom clone ujsj?zp4o4juoi / 
complete cds. 


1071 


100 


1313 


gi296594 


Hordeum 
vulgare 


pZ±i4U 


07 


^5 

jj 


1313 


AAY84592 


TT ■ 

Homo sapiens 


Amino acid sequennce of a human 
artemm polypeptide. 


1 1 A 

1 1 0 




1314 


gi6599232 


Homo sapiens 


mRNA; cDNA DKFZp434M0728 
^irom cione jjjsJrzjpHDHLvi\j / 
partial cds. 


1788 


99 


ill/I 


rrl 1 A/l 'JOT 1 ^ 

gllU43y / 1 J 


riomo sapiens 


r«rYMA- T7T T9^19Q fic r>1or>p 




100 


1314 


gi927637 


Anthocidaris 

era coicnma 
L-Idbblbpillct 


dynein intermediate chain 2 


607 


34 


1315 


gill611545 


Sus scrofa 


aldose 1-epimerase 


1670 


89 


1 "2 1 <5 

id ID 


gl!34ZZ / /y 


v^auio uac ter 
crescentus 


alUUoc 1 -cpiIllCIa.aC 


758 


45 


1 1 1 c 

13 1 j 


•noo/i /l no 

giyzy44yo 


AraDiaopsis 
thaliana 


amose i -epimerdsc"- uk.c proLciii 


745 


45 


1 oi r 

13 lo 


gijy 1 /OOO 


z,ea mays 


exiensm-iiKe proiem 


1 54 


95 


1316 


gi 1749842 


Yarrowia 
lipolytica 


cell wall protein 


123 


35 


111^ 
1316 


• o 1 ri /l 

gio 163634 


Streptococcus 
pneumoniae 


suriace piotem x'spi^ 


1 9*; 


9^ 


1317 


AACjU1757 


Homo sapiens 


Human secretea protein, oiit^J id 

TvTfV 

INL/, jojo. 


o 1Z 


QQ 


1317 


gil747 


Oryctolagus 
cumcuius 


trichohyalin 


238 


26 


1317 


gil3559600 


Caenorhabditis 
elegans 


short region of weak similarity to 
x^iasnioaiuiii yociii rnoptry protem 
(PIR:C4552l) 


218 


24 


1318 


gil0438135 


Homo sapiens 


cDNA: FLJ21924 fis, clone 

Xll^r \JH U o O . 


504 


57 


1318 


gil5l45793 


Sus scrofa 


basic proline-rich protein 


317 


38 


111 o 

131o 


gloj2o34 / 


v olvox carten i. 
nagariensis 


nyaroxypi onne-ncn giycopro iem 
DZ-HRGP 


jOj 


j i 


13 iy 


gll434oDoo 


Homo sapiens 


J5w±c/\r5 zrnc linger pro iem ^rvxv loj 
mRNA, partial cds. 


1 960 


60 


1319 


gi7023703 


Homo sapiens 


cDNA FLJ11191 fis, clone 
j^jL/VL/Xiiuu /Dyo, weaKiy similar 10 
ZINC FINGER PROTEIN 184. 


1389 


62 


1319 


AAB93576 


Homo sapiens 


Human protein sequence SEQ ID 

"MTV 1 1QQQ 


1389 


62 


1320 


gi6467391 


Homo sapiens 


TONDU (TONDU) mRNA, 
complete cas. 


160 


37 


1320 


gil2652601 


Homo sapiens 


TONDU, clone MGC2032 
TMAOF-^S04^?7 mRNA rnmnlete 

cds. 


160 


37 


1320 


gil3097189 


Homo sapiens 


TONDU, clone MGC:5266 
IMAGE:2900293, mRNA, complete 
cds. 


160 


37 


1321 


gil0435586 


Homo sapiens 


cDNA FLJ13544 fis, clone 
PLACE1006815. 


1399 


100 


1321 


AAB95456 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17927. 


1399 


100 



WO 03/029271 



PCT/US02/30474 



169 
Table 2A 





rill IJLl 




Tlocnr'infiAti 
JL/CaCI ipilUll 


c 

score 


identity 


1321 


gi4761481 


Homo sapiens 


isolate 27 immunoglobulin lambda 

Holii" pViain varianlp rpcHon ( TCtT . l 

1JLK,11L wllCllll VUllUUit> X\sflX\JXX I 1VJ1-/1 

gene, partial cds. 


82 


36 


1 399 


m 1043 5989 

til X \_/*-r«J —J 7 0^-< 


TTnmn cani' pti c 

XXUIXXU OdL/lCXlO 


cDNAFLT13841 fis clone 
THYRO1000787. 


3838 

\j - 1 \j 


99 


1 399 


A ARQ4774 


XXvJlllU odLJIClio 


T-TnTnan nrofpin <ipmipnpp S1FO TT) 

XXUXIXdll JJXUlAvXlX o UU UvllwVj k ' \—i IX/ 

NO* 15862 


3838 

»J \J *-t o 


99 


1322 


pi 143 3 6702 


Homo ^atiien*? 


1 6n 13 3 sea uence section 3 of 8 


283 

\ m J m~/ 


26 


1323 


gi9885296 


Homo sapiens 


LENG1 protein (LENG1) mRNA, 
partial cds. 


1056 


100 




/A./\J3 J4 1 OH* 


T-T*""\TYH""V OOTMPMC 

nuiliu odpiCIlb 


XXU-ilXdll Udllv^lCdLlL/ L/dlXW'C'l diXLX^C/Xi 

nrntpin Qpnupnrp SPO Tl~# TvJO*61 


R23 


98 


1323 


AAB43890 


Homo sapiens 


Human cancer associated protein 
<?ennence SFO TD N01335 


102 


24 


X —J 


ai57720 


P a Hi i q ra tin i q 


ribosomal nrotein S20 fAA 1-1 19 1 


331 


100 


1324 


gil5030141 


Mus musculus 


Similar to ribosomal protein S20 


331 


100 




cri 1 3Q601 33 

X.J7\JVX.7_) 


XXU1XJJJ DdL/XCXIO 


TilioGr\TVi5i1 Tirof pin £190 plrmp 

XXUU&UllldX IJJL VJ tt/ All JiiUj vl*JXXt> 

MGC:4151 IMAGE:3029762, 
mRNA comnlete cds 


331 

— ' — ' X 


100 


1325 


gil345408 


Mus musculus 


AT motif-binding factor 


453 


40 


1 395 


ai3Q94679 


LLKJLLlKJ odjJlCIlb 


PAP rlnnp PPS-QQ1G90 rnmnlete 

X7 J\X J 77 1 VJi.Uj C'VJXXXLylt' Ut/ 

sequence. 


443 


40 


1 395 


A AP93Q63 


Hnmn canipn c 
XXKJXxlxJ odkJlCllo 


AFP-1 ^Ala 2460 Val i 

ill x x ^jTaici jL,^ v ax j. 


443 


40 


1326 


gil2002024 


Homo sapiens 


brain my 03 8 protein mRNA, 

^^JlXllJlt/lw IslXo. 


371 


100 


1326 


gil0178219 


Arabidopsis 

tli n K si n ?i 

UldAldXld 


DNA-binding protein-like 


100 


41 


1326 


gi433638 


Saccharomyces 

pp-rpvi gi ja p 

1/&X V/ V XO iuU 


D441 


93 


36 


1327 


gi6566147 


Drosophila 

mpl an n era ctpr 


large Forked protein 


200 


32 


1327 


gi549986 


Pennisetum 

r'lli off* 


possible apospory-associated protein 


156 


34 


1 397 


A AO09fiR9 


XXVJIaIU odlJIClio 


1-Tnm!nn cpfrptpH nrr\i"piii ^1Th(^ TT^) 

JLJ.lXXJ.idII bCL/XCLCLl yJxKJlCxlXy Ol^y XX-7 

NO: 6763. 


1 56 


33 


1 ^9R 


gi I U'tjjoo / 


T-T/"\T'YH'"V OQTMPHC 

XlOIIlL) bdpiCllb 


pTYWA FT T1^7SS fk rlnnp 

PLACE3000363. 


67R9 


00 


1 ^9R 


A ARQ4745 


T-Tr\mr» ociniPTic 

rxuiiiu adpicxib 


l-TurnQn nrAfpin cpniipnpp QTh TTj 
XXlXlIldll piULClIl bCL[U.CIlL/C OXjV^ XXV 

NO: 15792. 


67R9 




1 39£ 


ail 3977579 


1-Inmn csitm'ptic 

xxuixiij odjjiciib 


XXVX^VVJX-/. J'-T J X -5^5 IXXXVXN^Vj 

partial cds. 


2247 


100 

X V/v/ 


1 390 


cti 1 43366QQ 


XXDIIHJ bdJJltllb 


1 1 *\ ^ c<=»rmp"nr*p c<=»ptir»n 9 nf S 
lUpi J.J bCLjUCJULC oCULlUXl Ul O. 


832 

O J *w 


51 

J X 


1329 


gi2961133 


Drosophila 

m pin ft (~\ CT C\ C 1 PT 

lllCldliUgdo LC-I 


nuclear fallout 


338 


28 


1329 


gil353761 


Naegleria 
jlo wien 


myosin II heavy chain 


193 


26 


1330 


ail 5081398 


Hnmn saniens 

AAVJAAAW OH^/lwiiO 


Vnitrnftl-like 7\t\c. fiti O'er nrotein 

JVA U-Uk/vl liAVv £jXXX\y lALlgvl UlUlvJUl 

(ZNF300) mRNA, complete cds. 


1901 


58 


1330 


gi 1020 145 


Homo sapiens 


Human DNA binding protein 
(HPF2) mRNA, complete cds. 


1469 


57 


1330 


gi5679576 


Homo sapiens 


mRNA for zinc fmger 41 (ZNF41 
gene). 


1596 


47 


1331 


gil2804013 


Homo sapiens 


clone MGQ4179 IMAGE: 3 63 8050, 
mRNA, complete cds. 


1351 


99 


1331 


gi!3938317 


Homo sapiens 


Similar to zinc finger protein 202, 


389 


69 
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S 
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identity 








clone MGC: 15660 

TM A OF -^147^1 1 mRNA rnmnlete, 

cds. 






Ijj 1 


gljUUOZ j 1 


riOIHO bapiCIlb 


PAP Hone "RP4-604GS from 7a22- 

JTjtVVw' ^/lUliG J\r t t"UU'TvJ J 11 Will / \^t~£* 

q31.1, complete sequence. 


406 


43 


1 ^9 


gijoy jZ 


iValLUb 

norvegicus 


t\ii1" •tTrprvrs+i p fpern mtnrv ctr\Y—s 

UUl. UiCUpLiL' ICgU.Ia.lWl jr laUlUl"/. 


381 


98 


1332 


gil2052818 


Homo sapiens 


mRNA; cDNA DKFZp564B1162 

ffrnm rlnne r>KF7r>^64R1 1 62V 

complete cds. 


232 


34 


1 ^9 


giozzo / Ol 


jj" io iy u a ten uiii 

discoideum 


place \/TT nnrmiA/pnti nun 1 rnvfiQlTi 

Uldoo V 11 LillL/Dll V ClltlUllCll lliyUoXll 


201 


^8 


1333 


gi9309467 


Mus musculus 


leucine-rich glioma-inactivated 1 

TYmtpin nTPPi lfQni" 

piVJlClll piCl/UloUl 


1128 


46 


1333 


gi4091819 


Homo sapiens 


leucine-rich glioma-inactivated 

-f-\-f-/-\-fa-i ri tTrpnifcnr ( T (t-T 1 I tnT< MA 
piULClll piC^UloUl ^JL/VJXl j llilvl > .fA., 

complete cds. 


1121 


46 


1 


/\/\ y jozyy 


jciomo sapiens 


TTnirmn cf a r*TP»i"f a r1 tYmfpiTi F'nrTinf^H "n\7 
JTlUIllclll oCtiClCU piULClll \Jy 

frpnp 7tf^ 
gCUC / \J. 


1111 
iiii 


45 
™ j 


1334 


g il0434406 


Homo sapiens 


cDNA FLJ12735 fis, clone 

TSJT9PP90009SR wealdv similar to 

IN 1 .Z, XVJT JO, VY Ctilvl V O 111 111 £4.1 

ACTIVATOR 1 140 KD SUBUNIT. 


4071 


100 


1334 


AAB94270 


Homo sapiens 


Human protein sequence SEQ ID 
TsTO- 14689 


4071 


100 


1334 


gil67835 


Dictyostelium 
discoideum 


myosin heavy chain 


233 


19 


1335 


gil 1493544 


Homo sapiens 


PRO 1460 


412 


100 




glOio4o4o 


jtiomo sapiens 


rransienrin recepior ^iriv^; gene, 
complete cds. 


77 


jy 


1 


gllzOj4oy / 


xiomo sapiens 


ixansieiTin recepior k^lj i i 
clone MGC:3 151 IMAGE:3354176, 
rriRNA, complete cds. 


77 


j^ 


1336 


gil 5 079491 


Homo sapiens 


Similar to RIKEN cDNA 

1 90001 4T-T1 /l ctpiip rlnnp 

MGC:20255 IMAGE :465 1484, 

llilVJ. N A, OvylllLflC Lt/ CVJLo . 


1805 


100 


1336 


gil3879544 


Mus musculus 


Similar to RIKEN cDNA 

1 90001 4.IT 14 crpnp 


934 


88 


1336 


gil4043175 


Homo sapiens 


Similar to RIKEN cDNA 

1 90001 1A opTip rlnnp 

iZuUuiT , rii < T gene, oiuiie 
IMAGE:3 139657, mRNA, partial 

LUo. 


920 


100 


1337 


AAB 67449 


Homo sapiens 


Amino acid sequence of a human 
chaperone polypeptide. 


566 


100 




glJOZ4ZU4 


ocmzosaccnarom 
yces pombe 


anaj protein 


1 ^8 

1 JO 


90 


ijj/ 


gi- 7 / j / / \j i 


thaliana 


tc»f-r-of ri n OT^PTTt'l Hp TPTIPflt "DTOtPin 2- 

LC Li-Cl LI HjUUvUHUV' 1 1 O CtU LfJL \J Ivill ^ 

like 


165 


41 


1338 


gi5 107082 


Arabidopsis 
thaliana 


small zinc finger-like protein 


63 


38 


1338 


gi3879008 


Caenorhabditis 
elegans 


Similarity to C. elegans mut-2 
mutator strain transposon Tc5 
transposase 


85 


22 


1339 


gil4024367 


Mesorhizobium 
loti 


weak similarity to 
ubiquinone/menaquinone 


193 


31 
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Vnn<?vn thesis methvl transferase 






1339 


gi4981950 


Thermotoga 

lllallLlllla 


ubiquinone/ menaquinone 
V>in<3vn thesis metlivltransfera.se- 
related protein 


117 


34 




fTil QA70^7 


JT CUlLUCa 

agglomerans 


BioC 


118 


29 




giroyz.? / 


JT C LI Uo Clllilllll 

crispum 


tvrn^inp-rirH Vivdroxvoroline-rich 
glycoprotein 


97 


25 




rri^QI 771.1 

gijy 1 / / 


f^a^/ici nnrr Millie 


QT»praf1-7 

OlJt'iClVJ- / 


74 


35 


1340 


gi6090929 


Rattus 

I1U1 VC/glL/Ub 


carbonyl reductase isoform I 


49 


37 


1 1A 1 
1.54 1 


gn / 1 uzyz 


riorrio bdpieiia 


Human Hone 23909 rnRNA partial 
cds 


1942 


99 


1341 


AAW80398 


Homo sapiens 


A secreted protein encoded by clone 
cwl543 3. 


1887 


98 






J. .LUlliU i>d|Jlt/lio 


GAP-like protein CN61) rnRNA, 
complete cds. 


298 


36 


1 'XAO 

1 3*tZ, 


A ARS4-77Q 


T-TniYin ^aniftns 

A. 1U11 Ivy Odylwllu 


Human pancreatic cancer antigen 
protein sequence SEQ ID NO:681. 


701 


100 




A AT3^0Rfi7 

/\r\JD JUOU / 




Human zalplia3 1 


701 


100 


1342 


AAW27638 


Homo sapiens 


Secreted protein AP 1 62. 


363 


71 


1 iai 

1 JHD 


rri 1 1^A1Q7Q 


rxuinu odpiciio 


Hone MGO* 13040 

LlUllv 1V1VJ V> UvT^v 

IIVLAGE'3622924 rnRNA complete 
cds 


1212 


100 


1344 


AAY73346 


Homo sapiens 


HTRIV1 clone 619699 protein 
seauence 


1338 


60 


1344 


AAB43912 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1357. 


1337 


60 




m '1 9RHA79 1 


J-XUlllvJ OCljJlCXlD 


clone MGC-2663 IMAGE: 3 543 9 10, 
rnRNA, complete cds. 


1176 


55 


1 


gl 1 J J 1 007J 


T-Tr\TTir» cQm'pnc 


A/fT 7"R rnRNA comnlete cds 


2166 


100 


1 

1 j'+J 


gl Ij Jl O07 / 


lV/fnc mil cph li i q 


MT 7V 


692 


62 


1 lA^ 


rri 1 HAH^^Q 
gl 1 \J^-tD~>DDy 


T-Tr\mr\ ociniPTiC 
jnuiilU aapioiio 


cDNA FLJ12150 fls clone 
MAMMA 1 000422 


255 


37 


1346 


gil2053309 


Homo sapiens 


rnRNA; cDNA DKFZp434I099 
Cfrom clone DKFZp434I099); 
complete cds. 


1162 


99 


1 ^46 




T pi<sViinania 

J 'VlOlHllUlllU 

niaior 

XXXvX 1 X. 


7138 7 


142 


36 


1346 


gi295941 


Ovis aries 


trichohyalin 


115 


22 


1 ^47 


pi1 27^ 1 092 


T-Tnmo ^aniens 


PNAS-123 rnRNA, complete cds. 


306 


98 


1347 


gi9988480 


Ursus maritimus 


luteinizing hormone receptor 


71 


31 


1347 


gi296749 


Macaca 

laoLl^Uiaiia 


acrosin-trypsin inhibitor 


60 


37 


1348 


gil2018147 


Chlamydomonas 
reinnaraui 


vegetative cell wall protein gp 1 


216 


30 


1348 


eil5 145795 


Sus scrofa 


basic proline-rich protein 


194 


34 


1348 


gil0645308 


Leishmania 
major 


L8453.1 


241 


29 


1349 


gil4041991 


Homo sapiens 


cDNA FLJ14457 Us, clone 
HEMBB 10022 17, weakly similar to 
ZINC FINGER PROTEIN 91. 


2187 


100 


1349 


AAB95039 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16797. 


2187 


100 


1349 


gi9963804 


Homo sapiens 


zinc finger protein ZNF286 


1019 


47 
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(ZNF286) rnRNA, complete cds. 






1350 


gi2429118 


Leishmania 
maior 


DDI1;L3169.2 


101 


29 


1350 


gi7243252 


Bombyx mori 


protease-like protein 


78 


27 


1350 


gil4488325 


Oryza sativa 


Putative retroelement 


84 


33 


ljji 


ail 7QCK66 


1VXU.O lllUoUUlUO 




806 


48 


1351 


gil799568 


Homo sapiens 


rnRNA for stac, complete cds. 


796 


46 


19^1 


A AWT^Q&AI 


riorrio bdpxciio 


Amirtn nfirl QPnnPTiPP of ViilTffan Stac 

protein. 


796 


46 


lDJ>Z 


gi4oy 1 1 


ITLOITIO Sdpicnb 


aHf-nt/ln+p ViruiQP S f ATC^ TlVRNA 
dXXCliyXdlC J\JXXd&t/ »J ^iVlvJ y AliLXJ-^^v, 

complete cds. 


384 


100 


UOZ 


a a w /u^H-io 


nUIIlU bdpiCIlb 


T-TnmaTi r1iQf*a<ip rplatpH "nnplprvKflp 

XTxUXXXdil LlloCdoC/ JLCldLC/lX UUvlbU LlUb 

Vina<;p-3 CDRNK-^ nrotein 
sequence. 


384 


100 


1352 


gi89 18488 


Rattus 

• 

IlUi VC^lUU.o 


adenylate kinase isozyme 1 


361 


36 


1353 


AAG01667 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5748. 


62 


36 


1 TCT 
ijjj 


A A VQ £9G9 

AA Y oojyo 


riomo sapiens 


TTnman crt^ne* 9 O—f^n (~* r^fi PT1 "rvrofpin 
XlliXXldll gCXXC Z.v/ - C11L<UU.CU. JJJ.ULt/111 

fragment, SEQ ID NO:308. 


61 

\J X 


42 


1 1 CO 


gizzzy4j 


jtcana caiesoeiana 


A "M U -r\T*i»i^»i it* c r\~r 

j\L\sr pieouibui 


73 


22 


1354 


gil2654383 


Homo sapiens 


cytochrome c oxidase subunit VIb, 
clone MGC:3061 IMAGE:3344701, 


259 


54 


1354 


gil2803321 


Homo sapiens 


cytochi*ome c oxidase subunit VIb, 
rlnne MGr-1 1 53 TMAGF -3347455 
rnRNA, complete cds. 


259 


54 


1 9^4 




T-Tr\mA cjinipnc 

xxuxxxu bdpxciio 


rTnman mPTsJA for nvtonliTOTTiP c 

oxidase subunit VIb (EC 1.9.3.1). 


259 


54 


1 9*^ 


gll UtOt" / 


rioiiiu bcipiciib 


rTYNTA FT T1996Q fis clone 
NT2RP2005841, weakly similar to 

TInmn <2ar>iPn<? mRTsTA for AT,F5C3 


534 


29 


1355 


AAB95263 


Homo sapiens 


Human protein sequence SEQ ID 
NO- 1 7448 


534 


29 


1355 


gil 5029763 


Mus musculus 


Similar to RIKEN cDNA 
1 900004F94 ppne 


316 


29 


1356 


gi57121 


Rattus 

iiux vcgiLua 


ribosomal protein L37 


510 


96 




rriA^I 999 


riUIIHJ bdlJXCllb 


Wnman mP 1\T A firr ribn^inmal nrotein 

XXUillull HlXV-LNXl. 1 VJ1 llUUOUlliai U1UI.V11A 

L37, complete cds. 


510 


96 


1 9^£ 
1 J JO 


m'9Q9Azl1 


nUIIlU bd.pj.CIlo 


T"iV\ncfvm5i1 T^Totpin T ^7 ttiT? TnJ A 

complete cds. 


510 


96 


Ijj / 


m' 1 /199Q£QC 

gu^ozyoyo 


riomo sapiens 


t\ n i o 1 tyiT?TsIA "Fr\r T^i r»n K1pqp v -ma H- 3 
Udl Lldl llilVlN f\ 1U1 J_/\JLiUlC/ot/A""llia.u — ' 

(DM) domain, (DMRTC1 gene). 


326 

•p-' <w w 


100 


1357 


gil4329696 


Homo sapiens 


partial rnRNA for Doublesex-mab-3 
(DM) domain, (DMRTC2 gene). 


323 


47 


1 357 




Sue Qprnfa 


small nroline-rich rjrotein 


73 


32 


1358 


gi4056562 


Homo sapiens 


TNNT1 gene, exons 1-11 (and 
joined CDS). 


1205 


100 


1358 


gi339783 


Homo sapiens 


Human slow skeletal muscle 
troponin T rnRNA, clone Ml. 


1205 


100 


1358 


gi546023 


human, skeletal 
and cardiac 
muscle, rnRNA, 
899 nt]. [Homo 


troponin T slow isoform 
{alternatively spliced} 


1205 


100 
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sapiens 








13z>y 




.rnysarum 
polycephalum 


major pidsniuuiai iiiyuMii iicavy 
chain 


186 


22 

Zw 


1359 


gi553309 


Homo sapiens 


Human (clone SY2/10) golgin-165 
mrviNA, pdnidi oub. 


176 


20 


1359 


AAB69070 


Homo sapiens 


Human male enhanced antigen-2 
NO ? 


176 


20 


1360 


gi 1445 6079 


Drosophila 

UlCIallU 5£,aa LCI 


fucosyltransferase homologue 


709 


38 


1360 


gil3436134 


Homo sapiens 


clone MGC:11141 
IMAGE:3 837094, mRNA, complete 


555 


92 


1360 


gil4456077 


Drosophila 
melanogaster 


alpha 1,3-fucosyltransferase 


283 


33 


1361 


gil4529886 


Mus musculus 


bM14504.1 (novel protein) 


1090 


53 


1361 


gllU43yUOO 


riomo sapiens 


^TYNTA* T7T T99£19 fic rlrvnf* 
CJ_>lN/\. JT l^J ZZO I Z lib, UIUIIC 

HSI04965. 


1 006 


56 
w> \j 


1 *3 £2. 1 

1361 


AAEUz / /y 


Homo sapiens 


Unman T>T?0 f" 1 A/ff^ J.^ nrnfpin 

encoded by DNA-C-MG.45-1776 
cDNA clone. 


1 079 

IV / 7 


47 


1 1 /CO 


glzJzJzo / 


multiple 
sclerosis 
associaieo 
retrovirus 


p o lypro iciii 


^1 09 


82 


1362 


gi535518 


Murine leukemia 

• 

virus 


gag-pol polyprotein 


1292 


39 


1362 


gi331995 


AKV murine 
leuKemia virus 


gag-pol polyprotein (tag amber 

r>r\r\nn at 99^0 99^9 inc^rtQ (Tin in 
LOUOIl ell Z./-J)\j-~Z~Z*J>jL lilbcl lb vjlll ill 

Mo-MuLV) 


1291 


40 


1303 


gn U43yy4z 


riomo sapiens 


HEP11392. 


-J J. 


99 


I303 


rr^ 1 AC\A 1 SQQ 

gll4U4 1 007 


jTiomo sapiens 


rTYNTA FT T14'394 ft<? Hone 

HEMBA1003235, weakly similar to 
TROPOMYOSIN 


1062 


100 


1363 


AAB92554 


Homo sapiens 


Human protein sequence SEQ ID 
NO-1 0741 


1062 


100 


1364 


gi5738559 


Homo sapiens 


mRNA for zinc finger protein, clone 
CZNF41.8, partial. 


324 


64 


1304 


gOO />o /o 


riomo sapiens 


mT27\J A for 7inr fincrpr 41 r77sTR41 

gene). 


324 


64 


1364 


gi5738553 


Homo sapiens 


mRNA for zinc finger protein, clone 
czj!njt4 i ,D } pdruai. 


276 


65 


1365 


gi7020645 


Homo sapiens 


cDNA FLJ20500 fis, clone 
KAT09159. 


274 


35 


1 1 S" c 

1365 


gil2052860 


Homo sapiens 


lTlivINA; CDJNA iJJsJ^ Zvp J 04 ijzu / 1 

Cfrom clone DKFZr>5 6402071 »: 
complete cds. 


974 

Z / H- 


jj 


1365 


AAB51661 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 42 SEQ ID 
NO:101. 


274 


35 


1366 


gi6599235 


Homo sapiens 


mRNA; cDNA DKPZp434E2321 
(from clone DKFZp434E2321); 
partial cds. 


1410 


94 


1366 


gil4029609 


Homo sapiens 


nuclear receptor transcription 


84 


35 
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cofactor fSHARP^ mRNA complete 
cds. 






1 

1-500 


glZU jOoZ 


XvallUo 

norvegicus 


Vipaw npiirofi lament nolvneDtide 


98 


22 


loO / 


giOH-uyo /y 


jnUIHU beipit/llb 


•zinp fincrpr nrotftin 7NF229 

(ZNF229) mRNA, partial cds. 


2299 


100 


1 i&n 




JTLOniU bapiCilb 


pDNA FT T13029 fis clone 
NT2RP3001057 moderately similar 
to ZINC FINGER PROTEIN 9 1 . 


1941 


48 


1 JO / 




JnLUIIlU bapicilb 


T-Tiimfin "nrotftin spniipnce SEO ID 

NO: 17486. 


1941 


48 




'A A C±C\ACSAQ 


XlUlllU ba.piCJ.lo 


TTnman <?prrpfpd rvrotein SEO ID 

NO: 8130. 


490 


90 


1368 


gil4532516 


Arabidopsis 

Irl CI 1 1 Q Tl CI 


AT5g05210/K2All_8 


98 


23 


1368 


gil61292 


Loligo pealei 


neurofilament protein 


101 


21 




gll J 1 5Z ly I 


nuinu octpiciio 


IpnpiTip-ripii-rP'npat "protein RN02 
mRNA, complete cds. 


1353 


97 


1 joy 


gnu i yozuy 


ilUIllU bd.piCl.lb 


raQnnQP rprmitmeTit domain nrotein 

7 mRNA, complete cds. 


911 


40 


i joy 


giz)y i iyj?y 


XlOXIlO bdpiCilo 


mRNA- cDNA DKFZt>58601822 
(from clone DKFZp58601822); 

nartial c,fi<5 


908 


43 


1370 


gil3 183793 


Homo sapiens 


CECR2 protein (CECR2) mRNA, 
complete cds. 


5992 


95 


1370 


gil502355 


Saccharomyces 
• • 

CCiCVlblclC 


GCN5 


248 


33 


1370 


gi9931486 


Mus musculus 


cell proliferation related protein 
CAP 


206 


30 


1371 


gil0800858 


Homo sapiens 


mRNA for aminopeptidase B 
(RNPEP gene). 


3479 


99 


1371 


gii DUozouy 


Jtlonio sapiens 


argiiiyi d.iiiuiupcpLiu.abc 
(aminopeptidase B), clone 
MGC:20408 IMAGE :463 6934, 

ml? ~\J A rninnlptp p/1 Q 
lllivlN^X, OUllipiCLC l^U.o. 


3435 


98 


1371 


gil0933784 


Homo sapiens 


partial mRNA for aminopeptidase B 
(APB gene). 


3431 


98 


13 /z 


1 O C? A/1 A 'I 1 

gllzoU4w31 


nomo sapiens 


cionc ivivjv^. iut-jj 
TM A CtF- ^94 1786 mRNA complete 

cds. 


2355 


100 




A ARAAC^S 
AAD O^r OJO 


nomo bapieiib 


Wnman cpprptpH "nrotPin <;pnilpnce 

jnillllCill bCL/lClCU. pi\JtV/llX QL'VJlAk/Alv/^ 

encoded by gene 41 SEQ ID 
Tvrn- 144 


774 


100 


mi 


gi7332094 


Caenorhabditis 

ClCgallb 


contains similarity to Pfam family 
PF00076 fiRNA recoffnition motif) 
score=82.2, E=le-20, N=l 


500 


47 


1373 


ei6841246 


Homo satriens 


HSPC298 


528 


90 


1373 


gi9758695 


Arabidopsis 
thaliana 


antifungal protein-like 


57 


36 


1373 


gil2006106 


Homo sapiens 


IRA2 mRNA, partial cds, 
alternatively spliced. 


54 


57 


1374 


gi6984160 


Streptococcus 
cristatus 


srpA 


205 


17 


1374 


gi532113 


Caenorhabditis 
elegans 


homeotic region most like 
HMPB DROME: homeotic 


196 


20 
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proboscipedia protein 






1 on/I 

1374 


gno izyz 


LrOligo peaiei 


ncuroiiiaiiiciiL piuicin 


181 


22 


1375 


gil4249827 


Homo sapiens 


clone MGC: 10992 

T1VA A r;"P*^^^7^R7 mP*NTA rnmnlpte 

cds. 


1714 


100 


1375 


gil4602889 


Homo sapiens 


clone MGC: 13 119 
IMAGE:4 100726, rnRNA, complete 

COS. 


1714 


100 


1375 


gil3279188 


Homo sapiens 


clone IMAGE:3626861, rnRNA, 

paXLldl LUb. 


1354 


100 


1376 


gil0437767 


Homo sapiens 


cDNA: FLJ21628 fis, clone 

PAT 0£07fi 


2196 


100 


1376 


gi9502202 


Homo sapiens 


endothelial zinc finger protein 
induced by tumor necrosis factor 
aipna \i2tZ-iV ii j iiicvln^v, L-unipicic 
cds. 


1734 


65 


1376 


gi488555 


Homo sapiens 


Human zinc finger protein ZNF135 
rn±sJ>jy\ ? compieie uus. 


1473 


58 


1377 


gil3543446 


Homo sapiens 


clone IMAGE:3529534, rnRNA, 
partial cds. 


4267 


99 


1377 


gil915977 


Homo sapiens 


Human Uor-l receptor (rivioj gene, 
complete cds, and (SMF) gene, 
partial cds. 


1 QQid. 

1 774 


1 no 


1377 


gi557822 


Saccharomyces 
cerevisiae 


c<+o 1 Ion • 1 7 fJl PAT- n Q 

maiD, stai, len. uo/, l>/vi. u.o, 
AMYH_YEAST P08640 


1 \J\J 




1378 


gil2053173 


Homo sapiens 


rnRNA; cDNA DKFZp434A112 
^irorn cione j_^j.v.r z^pn jh-^v i i^yS 
complete cds. 


3184 


83 


13 /o 


gllU43444j 


riomo sapiens 


pfllNJA FT T17761 fiQ rlnne 
"NFT7RP9001 ^78 weaklv similar to 
MUCIN 2 PRECURSOR. 


3143 


99 


1378 


AAB94287 


Homo sapiens 


Human protein sequence SEQ ID 
NO- 1 4778 


3143 


99 


1379 


gi5821143 


Homo sapiens 


rnRNA for RNA binding protein, 

L7 CIA LA CIA V^LAo, WlUXlu. J-Vvy^ . 


156 


27 




gioo^yz^fz 


T-T/ - vmf\ OOtMPMO 

xiomo bapieiio 


(SRM300) rnRNA, complete cds. 


156 


27 


13 /y 


gllo /34 


oiycine max 


J_^1N/A.-QUCULCL1 XvlN s\ puiyillClabC 


153 


32 


1380 


gi847724 


Homo sapiens 


Human methylthioadenosine 

pilOSpilOiyiabe ^lVll/\JTy I1JJVLN^\, 

complete cds. 


1457 


95 


1380 


gii ioUz3yz 


xiomo sapiens 


metnyimioaaenobiiic piiubpiiuiyidoc 
(MTAP) rnRNA, complete cds. 


1454 


95 


13oU 


'111 i 1 A1/Z 

gll3 1114/0 


jvms museums 


mexnyiimoaaenosme pnotjpiioiyiabc 


1370 


90 


1381 


gil2584839 


Homo sapiens 


HT036-ISO (HT036-ISO) rnRNA, 
comolete cds. 


826 


94 


1381 


gil2584841 


Homo sapiens 


HT036 (HT036) rnRNA, complete 
cds. 


826 


94 


1381 


AAG03594 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7675. 


398 


88 


1382 


gil2584839 


Homo sapiens 


HT036-ISO (HT036-ISO) rnRNA, 
complete cds. 


931 


91 


1382 


gil2584841 


Homo sapiens 


HT036 (HT036) rnRNA, complete 
cds. 


754 


89 
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1382 

X mmJ \J A—t 


eil2543184 


Corvnebacterium 
glutamicum 


RXA01089 


312 


43 


1383 

X w> V_J w> 


2il2584839 


Homo sapiens 


HT036-ISO (HT036-ISO) rnRNA, 
complete cds. 


916 


99 


1383 


eil2584841 


Homo sapiens 


HT036 (HT036) rnRNA, complete 
cds. 


739 


99 


1383 


eil46120 


Escherichia coli 


glyoxylate-induced protein 


324 


39 


1384 


gi 10440340 


Homo sapiens 


cDNA: FLJ23598 fis, clone 
LNG15381. 


330 


82 


1384 


gil0434023 


Homo sapiens 


cDNA FLJ12500 fis, clone 
NT2RM2001675. 


292 


53 


1384 


AAB94072 


Homo sapiens 


Human protein seauence SEO ID 
NO: 14260. 


292 


53 


1385 


AAB95751 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18660. 


155 


36 


1385 

X *S \-J m** 


AAY 16782 

A A..A A. A. A / W*^ 


Homo sapiens 


Human secreted protein (clone 
cg426 8). 


141 


54 


1385 


gil2053181 


Homo sapiens 


rnRNA; cDNA DKFZp434F142 
(from clone DKFZp434F142); 
complete cds. 


99 


35 


1386 


AAB92536 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 10701. 


2682 


100 


1386 


AAW29657 


Homo sapiens 


Human secreted protein BP101_2 
(alternative). 


1615 


100 


1386 


AAW29650 


Homo sapiens 


Human secreted protein BP 101 2 . 


1579 


100 


1387 


2i6103649 


Homo sapiens 

X. X KJ X X XKJ kJUpiwllU 


F-box protein FBX10 rnRNA, 

_1_ 1 . ^ A. Ih* A A.AA A> ,A, ^ A A. A> V* A AAA. «A » A AV« 

partial cds. 


2878 


99 


1387 


2i6164739 


Homo saoiens 

X X V*/ X a X\^/ UViL/ aVXIU 


F-box protein FbxlO (FBX10) 

J, • ■ V i A. L/A bVAAl -A »^ «lt A % A. A^ A AV A V«» # 

rnRNA, partial cds. 


963 


100 


1387 

A 1 VwJ / 


AAY83080 

x 12 l i u •w-' vy \j \j 


Homo sapiens 

A. A. V_<* A A IV/ k_J \A- lw* 1V11U 


F-box protein FBP-12. 


963 


100 


1388 


gil5082426 


Homo sapiens 


Similar to RIKEN cDNA 
2810055F11 gene, clone 
MGC:20203 IMAGE :4 6 84 6 87, 
rnRNA, complete cds. 


1840 


99 


1388 


gil3435795 


Mus musculus 


Similar to RIKEN cDNA 
2810055F11 gene 


1661 


88 


1388 


gi9695312 


Trypanosoma 
cruzi 


B-cell mitogen precursor 


483 


32 


1389 


gil017722 


Homo sapiens 


Human repressor transcriptional 
factor (ZNF85) rnRNA, complete 
cds. 


2077 


73 


1389 


2il86774 


Homo s aniens 


Human Kruopel related zinc finger 

X X LU.1A1411 XaX lA 1_S WA a. A W* 1» W \A MAAAV A. AAA^m a 

protein (HTF10) rnRNA, complete 
cds 


2068 


67 


1389 


gi6088100 


Homo sapiens 


rnRNA for zinc finger protein 
CZFD25^ complete cds 

\ XJX J — aW w' / 2 vv/JLiip XV Iv WUJ • 


2029 


68 


1390 


gil0434854 


Homo sapiens 


cDNA FLJ 13031 fis, clone 
NT2RP3001 113, weakly similar to 
INTRACELLULAR PROTEIN 
TRANSPORT PROTEIN USOl . 


3194 


100 


1390 


AAB95279 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17488. 


3194 


100 


1390 


gi3044185 


Plasmodium 
falciparum 


mature parasite-infected erythrocyte 
surface antigen 


171 


18 


1391 


gi6561965 


Arabidopsis 


kinesin-like protein 


636 


48 
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t\~i 1 1 ?> n a 

lllclllcllxCl 








1391 


gil2862607 


Mus musculus 


kinesin superfamily protein 1 9A 


664 


94 




rrn99'*Q9il9 
glZZ. 


oUIllZiUod.UL'Ilal Ulll 
vppq ■nombp 


IririPQiTi-liVf* TYrotein 

Jvill&dlll ~11X\\S JL/XVJIX/X1X 


644 


49 


1392 


gil2053319 


Homo sapiens 


rriRNA; cDNA DKFZp5 86104 18 
(from clone DKFZt>5 86104 18): 
complete cds. 


1011 


99 


1392 


gil0436265 


Homo sapiens 


cDNA FLJ13954 fis, clone 
V79AA 1001 145 


401 


98 


1392 


AAB94842 


Homo sapiens 


Human protein sequence SEQ ID 
NO:16016. 


401 


98 


1393 


gi 13442786 


Mus musculus 


Drctnnbla 


773 


70 


1393 


gl 13442 /o4 


riomo sapiens 


ITli.Vl>/\ JLOr LJL\\~/ X IN IN XD I /A., L/U1X1JJXCIC 

cds. 


752 


66 


1393 


gizo2Z4oo 


xoecinopsis 
occidentalis 
occidentalis 


iviijLVw/ cidbb ix pro LC 1X1 


65 


33 


1394 


gllZODiziD 


Homo sapiens 


euKaryoiic uansiaiioii liiiiiaLiuii 

-fcmir^r *X cuVvi mit" ^ ( or^TTTmfl 40kTj 1 
IdUlUl D , bUU Hill I J) ^gcUHlUCl, ^ VylvJU/ ^, 

Hone MGC8431 TMAGE'2821 133 
mRNA comnlete cds. 


1 773 


95 


1394 


gi2351380 


Homo sapiens 


translation initiation factor eIF3 p40 
subunit mRNA comnlete cds. 


1773 


95 


1394 


gi3986482 


Homo sapiens 


translation initiation factor eIF3 p40 
subunit eene. exon 8 and complete 
cds. 


1769 


95 


X w? -7 w> 


ffi3 169261 


lVfus musculus 


T-box transcrmtion factor 


1584 


98 


1395 


gi3 128382 


Mus musculus 


MmTbxl4 


1398 


98 


1395 


gi 12082748 


Mus musculus 


T-box transcription factor TBX18 


970 


87 


1 

i jyo 


nri* 1 flA^ ^ 1 nn 


1-1 AIYin OCttMPtlO 

rionio oapiciio 


rTYNTA FT Tl ^1 R9 fis clone 
NT2RP3004253. 


702 


99 




J\J\r>yjDO 1 


JlOniO bdpiCUa 


JTLLUllClll Ul \J tt^lll Ot^LJ^Ll^ll^y^/ OjuvV^ 

NO- 17667 


702 


99 


1396 


gi6601502 


Arabidopsis 

■fli a 1 1 q n 51 
lllclllaiici 


arginine/serine-rich protein 


104 


30 


i^y / 


&i*1 9£*\^ 1 £S 


XxUllllJ oclLJlCllo 


7111^ firtcypr "nTntpiTi ^56 clone 

MGC:1413 IMAGE:3 13861 1, 
mR7\TA comnlete cds 


2656 


100 


1397 


AAZ55698_ 

da 1 


Homo sapiens 


cDNA encoding human zinc finger 
nrotein BMZF3 


2647 


99 


1397 


gi4894364 


Homo sapiens 


zinc finger protein 3 


2646 


99 


1398 


gi458692 


Homo sapiens 


Human (H326) mRNA, complete 

LfiXo. 


2060 


67 


1398 


gi200241 


Mus musculus 


protein PC326 


1597 


59 


1398 


AAB57007 


Homo sapiens 


Human prostate cancer antigen 

piULClIl oCL[L1C11L/C tJJJjK^ ±LJ 1NW. 1 O ~; . 


763 


74 


1 399 


H6467391 


Homo satuens 


TONDU (TONDU) mRNA, 
complete cds. 


158 


66 


1399 


gil2652601 


Homo sapiens 


TONDU, clone MGQ2032 
IMAGE:3504527, mRNA, complete 
cds. 


158 


66 


1399 


gil3097189 


Homo sapiens 


TONDU, clone MGQ5266 
IMAGE:2900293, mRNA, complete 
cds. 


158 


66 


1400 


gil4335444 


Homo sapiens 


axonemal dynein heavy chain 8 


4347 


70 
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(DNAH8) iriRNA, complete cds. 






1 A f\C\ 

J.4UU 


^ri1 AIIZAAAl 

gl!4o 5 3440 


ivlus mus cuius 


axonemai aynem neavy cnain o long 
form 


4^47 


70 


1 A f\f\ 

1400 


glDD / / 10 


oniamyaomonas 
reinhardtii 


gamma neavy cnam suuunit oi 
outer-arm dynein 


9R07 


47 


1 A C\ 1 


ir-i 1 A 1 OCQIQ 

gll4ly5j3j 


riomo sapiens 


riuosomai proxem i^iz, cioiic 
MGC:9760 IMAGE:3855674, 


7S4 




1401 


gil 86800 


Homo sapiens 


Human ribosomal protein LI 2 
mRNA, complete cds. 


754 


92 


1 /in 1 

14U 1 


AAUrUoo 1U 


nomo sapiens 


XxUIIXall dCUICLCU. piUtCJ.ll, OXjV,^ LjLJ 

NO: 7891. 


7S4 




1 AC\1 


gl /UZUoo 1 


nomo sapiens 


KAT02633. 


107 


25 


1402 


AAY35946 


Homo sapiens 


Extended human secreted protein 
sequence, oxiv^ ij-' inw. 


99 


24 


1402 


gi6467974 


Dendrobium 
grex Madame 
i nong-m 


MADS box protein DOMADS2 


83 


27 


1403 


gil 5079625 


Homo sapiens 


clone IMAGE:3957606, mRNA, 
partial cds. 


519 


36 


1403 


gllz /4o33z 


Homo sapiens 


UJNA cytosine metnyitransrerase d 
alpha (DNMT3A) mRNA, complete 
cds. 


1 A1 


11 


1403 


AAY540D / 


Homo sapiens 


Amino acid, sequence oi novo uin a 
cytosine methyltransferase 

TYMTV/TT"* A 
UiNIVl 1 J A. 


1 A1 




1404 


gil3938351 


Homo sapiens 


Similar to zinc finger protein 268, 
clone IMAGE:3352268, mRNA, 
paniai cos. 


3615 


99 


1404 


gil 86774 


Homo sapiens 


Human Kruppel related zinc finger 
protein ^xiiir lvj) rriiN.iN/\, ounipicLc 
cds. 


2590 


55 


1 ACiA 
14U4 


rri 1 CiAACMQSl 


rionio sapiens 


rnxviNA ior rLJuuujz prutcm, paiLitii 
cds. 




S7 


1 Af\< 


gllZUDoU 1 j 


xiomo sapiens 


iritviNA, cj-/iN/\ i.v'js^r z^pH-jH-j u i u 

^11 Dili U1U11U l^XVJ/ZJjJT-JT-J V/ 1 l*JJ) 

complete cds. 




1 00 




cti 1 747 


VJ1 y L/ tUldg Lib 

cuniculus 


tl lOl iUliy dlJULJ. 


149 

x r s 


19 


1 An 4 ; 

1H-UJ 


triOQI £ 

giyy 1 O 


x lasmoQiurn 
falciparum 


liver stage antigen 


146 


22 






xiomo sapiens 


J. lie allllllU aUlLl SCCj[UCllL/C KJl tllC 

AS 152 1 protein. 


1 77 


100 

X \J\J 


LHKJO 


AA W^tliOO 


riomo sapiens 


oecreiea protein /\o i jz i . 


177 


100 


1406 


gil 57622 


Drosophila 

in 1 a n c\ <y £i c "f* 3 t 
iiicidiiu^cto ici 


hairless protein 


65 


45 


1407 


gil0435212 


Homo sapiens 


cDNA FLJ 13262 fis, clone 
OVARC 10009 12. 


1020 


100 


1407 


AAB 94541 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15286. 


1020 


100 


1407 


gil 2044832 


Chlamydomonas 
reinhardtii 


DEAH-box RNA helicase 


132 


34 


140S 


gil0834636 


Homo sapiens 


double FYVE-containing protein 1 
mRNA, complete cds. 


1874 


100 
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cril 346QR01 

gllj'rOi'OUl 


xHJlllU od IJ JLt/lJLo 


tandem FYVE finders- 1 orotein 
mRNA, complete cds. 


1874 


100 


1 4HR 


oil 4049 04R 


JLAUlllU otxJJldlo 


cDNA FLJ 14493 fis clone 
MAMMA1002972. 


1760 


100 


1 40Q 


cril 9^^000^ 


Rnmn Qanipn*? 

L JLU111VJ ocljjxv^xlo 


te<?ti<;-«;necific nrotein TSP-NY 
mRNA, complete cds. 


2586 


100 


1 4HQ 




IxVJIIlU bdpiCllb 


TT <?anien<; CFNP-E mRNA 


149 

X 1 *S 


26 


1409 


gil53727 


Streptococcus sp. 
group vj 


M protein 


138 


23 


1 4 1 n 

141U 


gllZOJO / uj 


X1UI11U bdJJlCllb 


clone MGP-3048 TMAGE-3343305 
mRNA comnlete cds 


1182 

X X \J ^* 


100 


1410 


gi476061 


Saccharomyces 
cerevisiae 


YBR0834 


169 


40 


1 41 C\ 
1^1 \j 


cTi^9^7^ni 

gl->Zj? / J5U1 


cerevisiae 


Virl94n 


169 


40 


1 41 1 
111 1 




T-Ti iman 
1 1 innaii 

herpesvirus 8 


latent nuclear antigen 


232 


22 


14 1 1 

1*T 1 1 


glZ-„7 J J'T 1 1 


\_/ V lo aii^j 


Irichohvalin 

Li. IVllUli. Jf IX X XXX 


183 


24 


1411 


gi2 12449 


Gallus gallus 


nonmuscle myosin heavy chain 


182 


24 


1412 


gil2053319 


Homo sapiens 


mRNA; cDNA DKPZp586I0418 

ffrnm Hone DTCF7n586T041 8V 

I 11 Hill U'lvllC IVivl Zj^JOUIuTJIO^) 

complete cds. 


1195 


100 


14 IZ 


gl 1 U^jOZDJ 


XxUlIlU 1 bapiCllb 


cTYMA FT Tl ^954 fk clone 
Y79AA 100 1145 


406 


100 


1412 


AAB94842 


Homo sapiens. 


Human protein sequence SEQ ID 
NO:16016. 


406 


100 


141^ 


oil ^nan^R 

gll DkJO\J 3 DO 


XxUlllU odpiCIlb 


Similar to T OP8R74S clone 

Oil 1111 4X1 L\J X_/vvV w ^OCj / " w< j UlUilL/ 

MGC-20327 IMAGE'43003 13 
mRNA comolete cds 


1291 

X *w _X X 


98 


1413 


gil3623395 


Homo sapiens 


clone IMAGE:4053965 J mRNA, 

nartial cH<? 


1265 


98 


1413 


gi6841156 


Homo sapiens 


HSPC253 


912 


99 


1414 
l^t 14 


oil ^H7QQ71 
gii ju /yy 1 1 


1J.UII1U bajpiciio 


Similar tr» TJ'AST Vina<5P plntiP 
Ollllxicii lvj x rxu x ivxx i cx o 3 viuiiw 

MGC: 19784 IMAGE:3831196, 
mRT^JA coTTinlete cds 


2275 


92 


1414. 


cti 1 00665° 


Wnmn Qanipns; 

XXVjllIU OClLfXt/llO 


FT «? aniens mRNA for FAST kinase 

JL JL • UtiL/ivlXu JL Jt. JLJL x^_L i J. X 1U1 JL J. jLI^J' X 1V111UUW • 


2275 


92 


1414 


AAR72830 


Homo sapiens 


Human TIABP2. 


2275 


92 


1415 


gil2654589 


Homo sapiens 


clone MGC:2550 IivlAGE:2966683 5 
mRNA, complete cds. 


1252 


100 


1415 


AAB73513 


Homo sapiens 


Human transferase HTFS-20, SEQ 
ID NO:20. 


1252 


100 


1 A 1 < 


gl4zUU440 


Jvius museums 


r i \ sz linger- containing 

pUObpilOUlUblLlUC IVlllabC 


905 




1416 


gil79331 


Homo sapiens 


B94 protein mRNA, complete cds. 


403 


27 


141o 


gll ico 1 /4 


Kattus 
norvegicus 


similar xo yeasi oecop, owiss-jr iui 

A Ar>pcomn "NXn-mVtf*** T^^9S44* cimilaT* 
x\CUCbblOIl rNUHlDCi x jZOfr, bllllllcll 

to mammalian B94, Swiss-Prot 
Accession Number Q03 1 69; 
Method: conceptual translation 
supplied by author 


246 


94 

• 


1416 


AAY51115 


Homo sapiens 


Human HSEC6 protein. 


229 


24 


1417 


g i 104404 16 


Homo sapiens 


mRNA for FLJ00043 protein, partial 
cds. 


6737 


95 


1417 


gil3604143 


Mus musculus 


tangerin A 


1969 


50 


1417 


gil3604145 


Mus musculus 


tangerin B 


1155 


79 
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1418 


gi4589375 


Homo sapiens 


mRNA for Gab2, complete cds. 


438 


43 


1418 


gi4589377 


Mus musculus 


Gab2 


428 


52 


1418 


gi41 59804 


Mus musculus 


PH domain containing adaptor 

mo 1 ppi ll p» T"tQ7/frnh9 


428 


52 


1419 


gi4589375 


Homo sapiens 


mRNA for Gab2, complete cds. 


438 


43 


1 41 Q 


crt4 1 ^0SO4 


IVIUS 1T1U.S CUIUS 


i n CLUlIldlll CUlllaJtlxlllg <\\X<Xyj\\Jx 

molecule p97/Gab2 


498 


■J z< 


1 41 o 


tri4^RQ^77 

glHJOZO / / 


IVlUb ITlUbt/UlUb 


VJdUZ, 


428 


52 


1 490 


oil 7S9°£R4 


JTlOxlxU bapitJIlb 


Olillllal IU llUUoUxxld.1 ^/xUlCxXX \j\J 

Vina<?p ^91cT~) "nnlvnpntiHe 1 done 

MGCM1287 IMAGE-3945902 
mRNA, complete cds. 


27^7 


100 

x v/ w 


1490 




l-Tomo ^anipn^s 


Novel nrotein kinase SEO IE) NO* 

J. ^ V/ V wX L/X \J wv lll IVlllU O w ^ V—? JL-/ IX/ X ^ V/ * 

138. 


2065 


100 


1490 




xxuxxxu oajL/x^xio 


ribnQomal nrotein kinase mRNA 

XXl_/\JOVJXXXC4X UU Ul U Lvlll IVilltiOW Jl JLXLVL 1 1 X, 

complete cds. 


533 


55 


J. HZ- X 


tn'9^000^ 


frnllllQ CrflllllQ 

\JaiXUo tiCUXlXo 


pin fa mine rieli nrn tein 

C^XUICXXXXXXXV XXvil yjLVJ l\^L-LL 


151 

X w' X 


26 


1 49 1 


cn91 1 8Q6 


f"r£i11nQ ffjallno 

V_J ClXXLXo li CIXX lio 


li-pn IHpQTnnn 


145 


24 


1 491 

J. *TZ» J- 




firJi11n<3 trn11n<; 

VJCIXXUO (~y CI 1 1 LI o 


p.alfipsmon 

\s (X 1 Uv O X XXVJ IX 


145 

x r w' 


24 


1422 


gi 12654493 


Homo sapiens 


clone MGC:2718 IMAGE:2821816, 
mRNA, complete cds. 


1819 


100 


1422 


AAY 10843 


Homo sapiens 


Amino acid sequence of a human 

5CLICLCU IJILJLCHX. 


84 


36 


1422 


gi330135 


human 

llCiUCo ViiUo 1 


latency-related protein 2 


83 


43 


149^ 


cri9^S1 900 


N/1 n c mncpnlnc 
1VJ.U.O xlXUovllXU-o 


T<f i anticrpTi 

XVI dXXLXt^^/Xl 


1284 


100 


1423 


gi3219288 


Mus musculus 


PA28 gamma subunit 


1284 


100 


1423 


gil2655139 


Homo sapiens 


Similar to proteaseome (prosome, 

Illat/lUpcllll^ Z,o bULJUlxll, J , CHJI1C 

MGC:1394 IMAGE:3 139753, 

mT?~MA mmnlete en's 

XXXXVX > xx, V^VJXXlLMVw' Lv tUO . 


1284 


100 


1424 


gil0176999 


Axabidopsis 

LXiCtXXaXXCt 


pectin acetylesterase 


170 


25 


1424 


gil431629 


Vigna radiata 


pectinacetylesterase precursor 


174 


24 


1494 


cri^0470S9 
glJU^+ / uoz 


/\xa D1Q Op bib 

thaliana 


blllllla.1 LU V lglld laLilctlct 

pectinacetylesterase precursor 


171 

X / 1 




1425 


gil817526 


Anthocidaris 

Ciabblbpilia 


intermediate chain 1 


412 


58 


1425 


gi7580490 


Homo sapiens 


NM23-H8 (NME8) mRNA, 
complete cds. 


480 


40 


1/19^ 
14ZJ 


glo^UoLoj 


uanio reno 


nucieosiae uipnospnaie Kinase 
NDPK-Z6 


997 




1426 


gil3097804 


Homo sapiens 


clone MGC:4368 IMAGE :2 822930, 
mRNA, complete cds. 


845 


92 


1426 


gil327ooU4 


rr „ • _ 

Homo sapiens 


Clone 1VICjL,:z /43 !MACjxi:2ozZyoU, 

lliXVIN -TV, L/L>1X1|JXCIC v^vXo. 






1426 


gi6580722 


Yersinia 

pseudotuberculos 
is 


O-unit polymerase-like protein 


86 


26 


1427 


gi3880445 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF02214 (K+ channel 
tetramerisation domain), 
Score=79.5, E-value=2.3e-20, N=l 


194 


51 


1427 


AAY34129 


Homo sapiens 


Human potassium channel 


201 


43 
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K+Hnov28. 






1427 


AAB952U1 


Homo sapiens 


rluman protein sequence oxiv^/ ii^ 
NO: 17295. 


901 
ZU 1 


47 


142o 


gl930o8ol 


Homo sapiens 


mKiNA; CUfN/A. J^xsJrZvP j4 / UUOj 

(from clone DKFZp547D065). 


1 SQR 


00 


1428 


AAE00674 


Homo sapiens 


Human protein tyrosine kinase 
receptor (PTK) from clone 

TJ C 1 C TO /I C. 


1302 


100 


1428 


gi6453590 


Homo sapiens 


mRNA; cDNA DKFZp434F2322 
(rrom clone iJJsJ H Ap434rZ3zz > ;. 


1040 


60 


1429 


AAB92801 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11309. 


1306 


88 


1 /I 1 A 

1429 


gil2803785 


Homo sapiens 


oimilar to cell cycle progression z 
protein, clone MGC:3620 

TA/f A dV •7/ : i79nn ,c \ mPTsJA rntimlpfp 
llVLr\VJI1.303ZUUw), IIlXviN UOIlipiClC 

cds. 


9741 


1 on 

1UU 


1 a on 
14zy 


A A DO/1AQ 1 

AAr>y4Uo 1 


riomo sapiens 


numan protein sequence 011^ lr^ 


9774 


00 


1 47n 
143U 


A AT7A1 H71 


nomo sapiens 


rTUlIld.Il LLC a 111 U-Ulllalll-L-iJIlLclllillig 
-r-*ar»J a t*\'fr\f ^T^T^r^T?^ TM*r»t'< = *iTi ■fi*r»Tn 

HFIHQ20 clone. 


1 959 

1Z1J7 


1 00 


1 47 A 

143U 


rri 1 4HQ1 Q<\9 


IvdllUb 

norvegicus 




396 
^7 \j 


39 


1 47H 


rril 1791 47^ 
gll 13Z l^f-jJ 


JtvdCLUS 

norvegicus 


aLUS*y L 111 lCpCdL - !!^!! lllClllL/ldllC- 

spanning protein 


^94 

J 7T 


39 


1 47 1 
It- 3 1 


rril 1 87^7^4 
gll 1 o / D / OH- 


xiomo sdpieiib 


I1C UliJgC 11111 gCllC, pdiLldl L/Llo. 


1 ^01 

i. -J \J X 


100 


1431 


gil 504095 


Mus musculus 


DNA-hinding protein 


1153 


83 


143 1 


giioooy iu 


ivius museums 


neurogenin z 


1 1 4S 

1 1H-J 


R7 


1432 


gi5230714 


Pseudonaja 
textnis 


long neurotoxin precursor 


78 


34 


143Z 


A A T>A Q/^7 


riomo sapiens 


numan oJdv_,d protein sequence oxiv^ 
ID 10. 


so 


S9 


1 A 71 

143Z 


gllOZoDUl 


v esicuiar 
stomatitis virus 


pnospnoprotem 


uo 




1/177 
143 3 


gll oo 1 /3o 


/\.can tiiamo e u a 
castellanii 


myosm-i umamg protcm /A.udiii^j 


701 
oy 1 


99 


1 /111 
14 33 


rri 1 47A1 fc/^A 
gllH /VJlOOO 


u i c ty o s te n um 

U.lbl»/tJlU.CUlll 


Udlllill 




29 


1 477 


ail 7607K4S 


1VJLU.O IIlUoL/Ultlo 


i*iVtrkrmr , 1f a nQf a /nncTioo'PTii'n inhihltOT 2 


112 


28 

x— lj 


1434 


gil2584159 


Homo sapiens 


zinc finger protein 268 (ZNF268) 

lllXVl > L< Wllljpic LC 


5261 


100 


1434 


gil4579579 


Homo sapiens 


ZNF268B mRNA, complete cds, 
aitemdiiveiy bpiit/CQ. 


4395 


100 


1434 


AAB93278 


Homo sapiens 


Human protein sequence SEQ ID 

"MTV 1 9799 


4006 


98 


1435 


gil3094679 


Mus musculus 


rihosome receptor isoform mRRp47 


245 


26 


1 47S 


ail 7004677 


A/Til o miicnilnc 
IVlUb IllUoOUlUo 


rilin<!ninp rpppntnr l^nfotm TnRRlDol 


234 


24 


1435 


gil61958 


Trypanosoma 
cruzi 


surface antigen 


227 


21 


1436 


gi987652 


Escherichia coli 


hypothtycal protein in UNG 3' 
region 


129 


26 


1436 


gil 063 8 


Trypanosoma 
cruzi 


ribosomal protein P-JL5 


76 


43 


1436 


gi436146 


Trypanosoma 
cruzi 


TcP2beta 


76 


43 



WO 03/029271 



PCT/US02/30474 



182 
Table 2A 



SEQ 

X.XJ 


Hit ID 


Species 


Description 


S 

score 


Percent 
identity 


1437 


gil2653161 


Homo sapiens 


ribosomal protein L35, clone 
MGC:8582 IMAGE: 29609 87, 
mRNA. complete cds. 


311 


75 


1437 


gi562074 


Homo sapiens 


Human ribosomal protein L35 
mRNA comnlete cds. 


311 


75 


1 A'xn 




TToTTIO C51T>1<=*T1Q 
I1U11XU OClUivXJlO 


TVreast and ovarian cancer associated 
antigen protein sequence SEQ ID 
799. 


311 


75 






iiunivj octiJiC'ii.o 


Arn?A nrotein Comdex subunit 

ild U^/ L/l V U Will WVAAiL/lv/k. V# V* V 

p21-Arc (ARC21) mRNA, complete 
cds 


457 


96 


1438 


AAW29658 


Homo sapiens 


Homo sapiens BH272_3 clone 
secreted nrotein. 


457 


96 


1 O 


ai2209347 


Homo saoiens 


p21-Arc mRNA, complete cds. 


457 


96 


1439 


gil4042550 


Homo sapiens 


cDNA FLJ 14779 fis, clone 
NT2RP4000398 moderately similar 
to ZINC FINGER PROTEIN 140. 


1428 


65 


1439 


AAB93164 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12091. 


1428 


65 


1439 


gi456269 


Mus musculus 
domesticus 


zinc finger protein 30 


1349 


53 


1440 


eil322220 


Homo sapiens 


Nmi mRNA, complete cds. 


91 


32 


1440 


AAY06223 


Homo sapiens 


Apoptin-associating Nmi/Hou-like 
protein. 


91 


32 


1440 


AAW73472 


Homo sapiens 


Human cancer-related protein Nmi. 


91 


32 


1441 


jri9294050 


Arahidonsis 

thaliana 


nrotein kinase-like protein 

L/X Xi/ Will 1V111MU V HAfc^^ ^— ' V * * 


113 


30 


1441 

IHt 1 1 


tril 9R0409S 

£,1 1Z,OWtV/ZiJ 




clone MGO 11242 
IMAGE:3939938, mRNA, complete 
cds. 


100 


38 


1441 


gil914851 


lVfus musculus 


WW domain bindine protein 5 : 
WBP5 


62 


58 


1442 


gi48556 


Stenotrophomon 
as maltophilia 


chorionic gonadotropin receptor 
homologue 


63 


38 


1442 


?i442514 


S ten n tronh o mon 

as maltophilia 


CG-like protein 


63 


38 


1442 


gi9758931 


Arabidopsis 

tTi 1 i ^ n 3 
Lllcllicixici 


receptor protein kinase-like protein 


60 


28 




A AY70847 




Human NIP 2b nrotein 


739 


43 


1443 


gil2803291 


Homo sapiens 


BCL2/adeno virus E1B 19kD- 
interacting nrotein 2 clone 
MGC:1529 IMAGE:3343559, 
mRNA. complete cds. 


718 


46 


1443 


gi558844 


Hnmn <>£ir)iens 


BCL2/adenovims E1B 19kD- 
interacting protein 2 (BNIP2) 
mRNA, complete cds. 


718 


46 


1444 


gi2429362 


Santalum album 


proline rich protein 


150 


34 


1444 


gi6456717 


chayote mosaic 
tymovirus 


overlapping protein 


145 


28 


1444 


gil 043 8473 


Homo sapiens 


cDNA: FLJ22184 fis, clone 
HRC00983. 


141 


27 


1445 


gi6911211 


Mus musculus 


ATP -binding cassette protein 


268 


71 


1445 


AAB95548 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18171. 


97 


40 


1445 


gi3806158 


Caenorhabditis 


similar to the ATP-binding transport 


101 


39 
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P.lfiPTlTi*; 


nrotein familv ( ABC transporter) 
(Pfam: ABC_tran.hmm, score: 
162.68 and 139.46) 






1446 


gi633990 


Mus musculus 


zinc finger protein 


1970 


86 




gi l l \J L Zs £*\J 


lVTus musculus 


Ozrf 1 nrotein 

^ * A — I A- A A mm* A Xp-^ V AAA 


1967 


86 


1446 


gil336158 


Rattus 
norveeicus 


pancreas only zinc finger protein 


1960 


85 


1447 


gil3186114 


Homo sapiens 


mRNA for rab interacting lysosomal 
nrotein (RILP gene). 


1577 


94 


1447 


gil3436353 


Homo sapiens 


clone IMAGE:3632533, mRNA, 
nartial cds. 


1312 


93 


1447 


gi7549210 


Babesia 
bigemina 


200 kDa antigen p200 


156 


30 


1 AA9. 


cri 1 ^ 1 1 14 
gllJ iOUl It 




mRNA for rab interacting; lysosomal 
protein (RILP gene). 


2047 


99 


1 AAR 


gllJtJUJJJ 


TTnmn ^arnpns 


clone IMAGE:3632533, mRNA, 

VXVlAV A>4t ▼ 1 J A> * m* * • AW W w 2 A AAA. ^ * ^ A- -A- J 

partial cds. 


1782 


100 


1 AAR 


cri7S4.09 1 0 


bigemina 


200 kDa antieen n200 


152 


27 


1 AAQ 




XXU1X1U oCipiV/ilO 


Human {?ene 22 encoded secreted 
protein HOFNM53, SEQ ID 
NO* 105 


320 


67 


1449 


AAY36238 


Homo sapiens 


Human secreted protein encoded by 
gene 15 


115 


53 


1449 


AAB24023 


Homo sapiens 


Human PRO 14 10 protein sequence 
SEQ ID NO:9. 


115 


53 


1450 


gi 12002226 


Homo sapiens 


C3HC4-type zinc finger protein 
(LZK1) mRNA, complete cds. 


2042 


99 


1 A^C) 

ItJu 


L^l 1 KJ'-Xj) 1 A.Z>\J 


X X \J1L1\J OCllJ L\s LIO 


cDNA- FLJ21230 fis clone 

V/ J ^ A ™A A* A. Mm4%J AW A. AW J> A W J WA.X^A.A%P^ 

COL00741. 


2042 


99 


1 asd 

ItJu 


rri 1 17S??7? 
1 J / Z*Z. / Z. 




larvneeal carcinoma related nrotein 

W. LVX V 114- V V4> a W *^*-X- JLAXV a A aV*» a- »* A %r^* ^ aaa 

1 mRNA, complete cds. 


1504 


99 


1451 


gil3752754 


Homo sapiens 


zinc finger 1111 mRNA, complete 
cds 


1951 


62 


1451 


gil0436789 


Homo sapiens 


cDNA FLJ 14345 fis, clone 
THYRO1001 189, weakly similar to 
ZINC FINGER PROTEIN 9 1 . 


1873 


60 


1451 


AAB95862 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18929. 


1873 


60 


"IAS? 


til *J Z* KJjL. -J J> \J 




mRNA- cDNA DKFZp586G1822 
(from clone DKFZp586G1822); 
partial cds. 


72 


35 


1 A^l 


A AT^S^QVl 
AADJJ7 / D 


Xxuiiilj adpicua 


TTnman polon rannftT autiffen nrotem 

sequence SEQ ID NO: 15 13. 


67 


31 


1452 


gil 334643 


Xenopus laevis 


APEG precursor protein 


76 


34 


1 /I CI 

14;>3 




riomo sapiens 


xxU.IIld.Il DlCaol Ca.llL'd aobUtlaicu 

antigen protein sequence SEQ ID 
NO:613. 


964 


99 


1453 


AAB63250 


Homo sapiens 


Human breast cancer associated 
antigen protein sequence SEQ ID 
NO:6l2. 


415 


74 


1453 


AAB63249 


Homo sapiens 


Human breast cancer associated 
antigen protein sequence SEQ ID 
NO:61l. 


381 


81 


1454 


gi7239109 


Homo sapiens 


HSPC059 


2117 


63 
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S 

atui c 


Percent 

iHpntii'V 


1454 


gil3752754 


Homo sapiens 


zinc finger 1111 mRNA, complete 

UUb. 


2012 


64 


1454 


gi 186774 


Homo sapiens 


Human Kruppel related zinc finger 
protein (HTF10) mRNA, complete 
cus. 


2004 


62 


1455 


gil4042046 


Homo sapiens 


cDNA FLJ14492 fis, clone 

T\/f AA/TN/f A 1 0090^7 wt&cAAv cir-rtilar 

ivi/\iviiviAiuuzy d /, wcdKiy similar 
to ZINC FINGER PROTEIN 135. 


262 


28 




A A VlQ^ 1 
s\J\n>y D 1 VJd 


xiomo sapiens 


riuman proiem sequence ony iu 
"KTO* 17076 


9^9 
zoz 


98 
Zo 


1455 


gil4042143 


Homo sapiens 


cDNA FLJ 14549 fis, clone 
NT2RM2001670, weakly similar to 
7TNC FTNGFR PROTFTN 29 


253 


32 


1456 


gil5159614 


Agrobacterium 

ti i Tn p fa p i p n q 


AGRJL_2275p 


191 


37 


1456 


gil 5076209 


Sinorhizobium 

Tnplilrii"! 


HYPOTHETICAL PROTEIN 


187 


34 


1456 


gi7271117 


Cicer arietinum 


25.7 kDa protein 


135 


36 


1457 


gil4250716 


Homo sapiens 


clone MGC:13310 
IMAGE:4110431, mRNA, complete 


1267 


43 


1457 


AAB21018 


Homo sapiens 


Human nucleic acid-binding protein, 
NuABP-22. 


1095 


49 


1 4^7 


A AR9 1 Oid.9 


xiomo sapiens 


nuniaii nucieic acia-Dinanig proiem, 
NuABP-46. 


/^9 

ooz 




1458 


gil 110452 


Homo sapiens 


Human mRNA for hCREM (cyclic 
/\ivijr -responsive element 
modulator) typel alpha protein, 
complete cas. 


594 


99 


1458 


gi30217 

• 


Homo sapiens 


H.sapiens mRNA for cAMP 
responsivc-eieineiiL niuQuiaior 
(CREM) (partial). 


589 


98 


1 zKR 


ai771 7991 
gl / / 1 / zz I 


xiomo bd-piciio 


c/vivix responsive eiemeni 
modulator (CREM), exon G and 

[Jul Hal tUo. 






1459 


AAB54325 


Homo sapiens 


Human pancreatic cancer antigen 

nrotpin <;pfiiipnrp SFO TD lsJO*777 


751 


98 


1459 


gil4993576 


Dictyostelium 

H i <?p ni H pi l m 


deliriumA 


226 


29 


1459 


gil4488149 


Homo sapiens 


Authorin submission 
'FNTR YN A MF ' 

J->IN X XV. X l>(.rtJ.VXX-r . 


195 


33 


1460 


gil0436256 


Homo sapiens 


cDNA FLJ13949 fis, clone 
V79A A 1001 041 


2706 


100 


1460 


AAB94838 


Homo sapiens 


Human protein sequence SEQ ID 
NO-1 6008 


2706 


100 


1460 


AAB67573 


Homo sapiens 


Amino acid sequence of a human 
hvdrolvtic enzvme HVEN5^5 


2701 


99 


1461 


AAG03605 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7686. 


299 


98 


1461 


gil 5077806 


Danio rerio 


zinc finger buttonhead-related 
transcription factor 1 


55 


39 


1461 


AAY27742 


Homo sapiens 


Human secreted protein encoded by 
gene No. 36. 


56 


34 


1462 


gil 1595643 


Neurospora 
crassa 


probable abcl protein precursor 


870 


47 
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ID 


±11 1 ID 


species 


uescripuoii 


c 

score 


Pprrpn t 

identity 


1 A £0 

146Z 


glJUo 115 1 


Arauidopsis 
thaliana 


Axsoi proiem 


OOu 


46 


1 A 

146Z 


nrl 1 < 1 A £LA 1 

gll j 14641 


ocnrzosaccnarom 
yces pombe 


aociop 




4S 


1 A 

146:5 


gli4J 360 Jy 


xiomo sapiens 


lopij.j sequence secuuii i ui o. 


1470 

1 *T / \J 


QQ 

ZsZs 


1 A £S1 

1463 


gloj /ZD44 


xiomo sapiens 


j-meinyi-aQenme urN/\ giycuayid.be 

i*nT?"NJA pntnii 1 p^p 1 f Ho 
lllTviN jtv j t/UlIipiCLC LUo. 






1463 


AAB56919 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1497. 


1459 


98 


1464 


AArSDoy iy 


xiomo sapiens 


xiuman prosiaxe cancer aniigcn 
protein sequence SEQ ID NO: 1497. 


1 OJ 1 


yy 


1 A iZA 

1464 


glo j /Zj44 


xiomo sapiens 


j-metnyi-aaenme uina giycosyiase 
mRNA, complete cds. 


1 JjZ 




1464 


gll43366 /y 


Homo sapiens 


lop 13. 3 sequence section i oi o. 




1 on 


1 A £1 £. 

1466 


~j 1 /n o'-7 1 ^7 /C 

gil47o /l /6 


Mus musculus 


CoMJJl 


OA1 1 
D\)L 1 


09 

yz 


1466 


gil4787181 


Homo sapiens 


CUB and sushi multiple domains 
protein i snort iorm ibkina, 
complete cds, alternatively spliced. 


3046 


95 


1466 


nil AHC\AT^ 

gli4 /y4 /Z6 


xiomo sapiens 


i^ujt5 ana susni multiple aomains i 
protein mRNA, complete cds. 




yD 


1 A /C7 

146 / 


a a pm 170 
AAtjrUZl /o 


Homo sapiens 


xiuman secreted protein, £>Jd^ iu 


ZoZ 


o 


1467 


gi470322 


Daucus carota 


proline-rich protein 


108 


31 


1 a tin 
146 / 


rt M A1 flO 1 OO 

giiuiyoioz 


dadrastis 
kentukea 


Jd IN L^UZ 


1 n^s 

1 UJ 


9R 
Zo 


146o 


A A PA1 /I <T 


xiomo sapiens 


xiuman secretea protein, oxiv^ iu 
NO: 5538. 


J /Z 


1 no 


146o 


cr\H (Will 


xiomo sapiens 


cui>/\ rL^JA\JLDD us, cione 
COL06818. 


1 QA 




1 AAQ 
140o 


gll JZU7jDj 


uVenopus laevis 


Kiiieoiri-iiKc piuieiii 


1 fsl 
ID/ 




1469 


gi4033608 


Anthocidaris 
ciassispma 


B2HC 


1476 


42 


1469 


gil0710 


Tripneustes 
gratnia 


Beta heavy chain of outer-arm 
axonemai aynem a i .rase 


1189 


35 


1469 


gi2 17203 


Anthocidaris 
ciassispma 


dynein beta-heavy chain 


1182 


34 


1 A Hf\ 

14 /U 


gioZjuu^y 


Anopneles 
gambiae 


serme protease-iiice piotein 


Zho 


JO 


14 /U 


1 CA 1110/1 

gll julzlz4 


Mus musculus 


oimiiar to distal mtestmai serine 
protease 


9A1 


4 / 


14 /U 


glj / / /33U 


Homo sapiens 


esp-i mKJNA tor eosmopnii serme 
protease, complete cds. 


9A1 
Z^+l 




1 /I ~7 1 

14/1 


giyy j / /6U 


■1 I Hill ^-w ,-i ■ -•■ _ ■ J yr-m 

xiomo sapiens 


lv_Liv 1 j (JsJ^Jvijj gene, complete 
cds, alternatively spliced. 


1 ^97 
loZ / 


QQ 

yy 


1 A "7 1 
14/1 


gll44o4yZZ 


Homo sapiens 


prostmogen mKJNA, complete cos. 




QQ 

yy 


1471 


gi 11244759 


Homo sapiens 


serine protease gene cluster, 

CUIlipiCLC ocquciiL-c 


1230 


94 


1472 


gil0435369 


Homo sapiens 


cDNA FLJ13373 fis, clone 
PLACE 1000749. 


2500 


99 


1472 


AAB94615 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15462. 


2500 


99 


1472 


AAY51846 


Homo sapiens 


Human 18.1 homolog protein 
fragment. 


1382 


100 


1473 


gil4043862 


Homo sapiens 


clone MGC: 14138 

IMAGE:3948518, mRNA, complete 


1000 


100 
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XXJ 


Hit ID 


Species 
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S 

score 


Percent 
identity 








cds. 






1473 


gil2053377 

* 


Homo sapiens 


rriRNA; cDNA DKFZp586C1924 
(from clone DKFZp586C1924); 

pnmTilptp f*Hs 


1000 


100 


1473 


AAG03905 


Homo sapiens 


Human secreted protein, SEQ ID 
NO- 7986 


545 


100 


1474 


AAB47134 


Homo sapiens 


CDIFF-15, Incyte ID No. 
^478571 CD1 


1886 


51 


1474 


gi7022185 


Homo sapiens 


cDNA FLJ 10260 fis, clone 
HEMBB 1000973, moderately 
similar to Mus musculus schlafen3 
mRNA 


943 


39 


1474 


AAB92636 


Homo sapiens 


Human protein sequence SEQ ID 
NO* 10951 


943 


39 


1 47S 


A AR471 ^4 


Homo cQT^ipfTc 

XXUXIXW OClLylwilJ 


CDIFF-15 IncvtelDNo 
3478571CD1. 


1280 


63 


147S 

It 1 / J 


cri70?91 R5 


TTriTTIO cnTiipTT? 


cDNA FLT10260 fis clone 
HEMBB 1000973, moderately 
similar to Mus musculus schlafen3 
mRNA. 


600 

V./ \J \J 


38 


147S 




TTnrnr* Qfinipn^ 

XXUllXvJ duUlvllO 


Unman irrotein seauence SEO ID 
NO: 10951. 


600 


38 


1476 


tH6708478 


lVTiis rnnspiiliis 


formin-like nrotein 

1V/1 x xxxxx iiivv r" ^ l w xxx 


509 


65 


1476 


gi41 01720 


Mus musculus 


lymphocyte specific formin related 
protein 


466 


71 


1 476 
14 / o 


ail ^44746R 


A Gi"\*=»T*0"i 1 1 n c 
.r\.opci glillxo 

ninnlfins 

XXLUlti.C4X.LO 


T7T-T1 /Th I— T"7 rvrntfMn Ti c\ in o 1 o o 


141 


25 


1477 


AAG02288 


Homo sapiens 


Human secreted protein, SEQ ID 
NO- 6369 


561 


100 


1 477 


pi 1 495971 


A ra hi H nnsi s 

thaliana 


RNA hehcase 


290 


58 


1477 


cn'9?9^9^S 


AraHiHrvnsis 

thaliana 


nrR-mRNA stjlicin*? factor ATP- 
dependent RNA helicase-like 
protein 


290 


58 


1478 


AAY45013 


Homo sapiens 


Protein encoded by fchd531 gene. 


443 


40 


1478 


AAT94467 
aal 


Hnmo saniens 


Human Fchd53 1 eene differentially 
regulated in endothelial cells. 


443 


40 


1478 


gi9886891 


Mus musculus 


zinc finger protein 276 C2H2 type 


469 


35 


1479 


ai6^1 8855 


TTfimn sanipns 

± Jl\J L11\J OC*lJJ-wlli3 


A -kinase anchoring trrotein 220 
mRNA, complete cds. 


323 


23 


1 470 


ai 1 ? 1 64 1 1 


IVClLLUb 

norvegicus 


AKAP 220 


229 


23 


1479 


gi8885520 


Streptococcus 
gorQoiui 


streptococcal hemagglutinin 


200 


20 


1480 


gil4042307 


Homo sapiens 


cDNA FLJ 14644 fis, clone 

NT9PP9001 756 wpaklv similar to 
i>) i ^Xvr/.uui / ju, vv cciivi^y oiiixxxcix l\j 

ZINC FINGER PROTEIN 84. 


1130 


100 


1480 


AAB95176 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17237. 


1130 


100 


1480 


AAB58718 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence SEQ ID 
426. 


452 


98 


1481 


AAG0043 1 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4512. 


232 


80 


1481 


gill 120624 


Homo sapiens 


cationic trypsinogen (TRYP1) gene, 


39 


50 
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TTit TT> 

111 L 1L/ 




Descrintion 


s 

score 


Percent 
identity 








TRYP1-L104P allele partial cds. 






1481 


gi31863 


Homo sapiens 


H.sapiens gene for myeloperoxidase. 


58 


37 


1 /ISO 


rri 1 A"K 1 7RQ0 


f-ici 1 l"j 1 o rrdllnc 
VwJcHlUo gdlxUis 


OjJlllUXlll 


280 


65 


1482 


gil2052957 


Homo sapiens 


rnRNA; cDNA DKFZp566G0346 
(from clone DKFZp566G0346); 

p Atrrnl pf p fT?c 
L/UlllplCIC l/Uo. 


280 


65 


1482 


gil4317888 


Gallus gallus 


spindlin 


276 


65 


1 A Q1 


gi^y /uyo4 


nomo sapiens 


PAP plnnp "RP1 9 1 SP1 S from 1 ? 
r/\Ks Clone j\ri-zijnj nuiii jqiz,, 

complete sequence. 


1 


95 


1 /I Q1 


gl^Ooo 111 


nomo sapiens 


rl uione vjor xxum JL[iz, 

complete sequence. 




95 


14oJ 


gl!3 14:540 


nomo sapiens 


xiuman survivaJ. moioi ncuiuii 
(SMN) gene, exons 7 and 8, and 
complete cds. 






1 A O /I 


A A Z" 1 AO A A1 


Homo sapiens 


Jriuman secreiea proiem, onv^ llj 
NO: 6524. 




1 00 

1 \J\J 


1484 


AA Y D\)yZ 3 


nomo sapiens 


nUIIlall iclal Did 111 {sLJ±\f\ C'HJliC' 

vclO 1 derived protein. 


6Q 


43 


1 /I Q/t 

14o4 


giio /yooo 


nomo sapiens 


riunidn didxm-z rcidLcu. pxuLciii 
niRNA, partial cds. 






14oj 


gi3o i / 1 y^t 




lllXvlN /A j L/JLvl > ^V. i^JNJ. Zjp'TJ'Tl Ul 1 

(from clone DKFZp434F01 1); 
partial cds. 


408 


100 


1 /t 


A A VOHOQQ 
AA i ZxJZyo 


nomo sapiens 


JTLUIIldll dpUlipupiULClll JZ< IlltlldllL 
■nrntpin -pra crinRTit" 1 1 

UlULt^lll lldgjlllCllL J. J. . 


1 04 


38 


1485 


gi2130539 


Homo sapiens 


gallbladder mucin MUC5B rnRNA, 

r\c>'H'i5il r*rlc 
pal Licii i/Uo, 


109 


33 


1486 


AAW19919 


Homo sapiens 


Human Ksr-1 (kinase suppressor of 

Xvab ). 


294 


81 


1486 


gill71250 


Mus muscnlus 


protein kinase related to Raf protein 

JvlIldbCo, IVICULIUU.. 1/Ulll/CJJlUa.l 
11 Cllio 1CI L1W11 o UllbU J \,Ll\J X. 


279 


78 


1486 


gil322382 


Cyprinus carpio 


ZP2 


175 


43 


1 A9.1 
Ho / 


gllU^t J7DJU 


XlUUlU bctpiCIlb 


rD"NTA* FT T?^05Q fi«; Hone 

LNG03912. 


2810 


99 


1 A9.1 
Ho / 




nomo sapiens 


TTumQn c/=»r*T , p»i"f='H - r>Tr»'ff*lTl SsF TT~) 
nuilldll aCWCLCU Lyxutciiij ojjy u_y 

NO- 71 70 


306 


100 

X \J\J 


1487 


gi4322670 


Homo sapiens 


dentin phosphoryn rnRNA, complete 

vUj. 


116 


18 


HOO 


oi'0dS441 6 


l\/Ti i c mncpiiliio 
IVXUb HlU.oULllU.fj 


vinp "Finorpr "nrotpin SnS 


2122 


97 


1488 


gil5077806 


Danio rerio 


zinc finger buttonhead-related 

Liciiiov^i ipiiuii laviui x 


744 


60 


1488 


gi466466 


Rattus 

• 

•jt nnfp cri nit c 
11UI VCglV-'Ua 


zinc finger protein 


565 


55 


1489 


gi4323152 


Mus musculus 


Ets-protein Spi-C 


880 


67 


1 A&Q 
may 


glo /HOH-lH- 


A nl i-v-fi /"\ porn 

/vuiono udi a 
hansbaenschi 


Opi U UdUoL/liptlv^ii ia^LUi 


476 


49 


1489 


gil 1245502 


Raja eglanteria 


SpiC 


449 


42 


1490 


AAG01095 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5176. 


264 


70 


1490 


gi7644350 


Homo sapiens 


golgi matrix protein GM130 
(GOLGA2) rnRNA, complete cds. 


195 


69 


1490 


gi8099669 


Homo sapiens 


golgin-like protein (GLP) rnRNA, 
complete cds. 


178 


66 


1491 


gil2053013 


Homo sapiens 


rnRNA; cDNA DKFZp434J01 13 


125 


41 
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/'from rlnne DKF7n434J01 13V 

complete cds. 






1 A Q1 


gizyooo i 


lo s triQium. 
thermocellum 


Q_1a\/pr nrrttpm 


156 


31 


1492 


gi347377 


Homo sapiens 


Human MLL-AF4 der(l 1) fusion 
nmtpin mRTsTA cornolete cds 


106 


22 


1492 


gi3822238 


Lymantria dispar 

nucleopolyhedro 

• 

virub 


mucin-like protein 


114 


25 


1493 


gi6683745 


Homo sapiens 


HOTTL protein mRNA, complete 


252 


26 


1493 


gi5419859 


Homo sapiens 


mRNA; cDNA DKFZp434B103 
f from Hone DKFZn434B103V 

l 11 VJJ.11 UlUllC J-^XNJ. -£_/L7T^~TX-» X V7«-» y 5 

partial cds. 


256 


27 






nomo bdpieiib 


T-lrpaQt" flnH ovarian cancer associated 

antigen protein sequence SEQ ID 
617 


256 


27 


1494 


gil4043433 


Homo sapiens 


clone IMAGE:3952677, mRNA, 

■nartial pHq 

JJcULLLCll L/lJo. 


1371 


100 


1494 


gi7023601 

* 


Homo sapiens 


cDNA FLJ1 1 127 fis, clone 
PT APF100699S 


590 


41 


1 ACkA 




nomo bapiciis 


Human rvrrvfpin QPfiiiPtlfft SJFO 1 D 

NO-12861 


590 


41 

■ Jb 


1495 


gi7022422 


Homo sapiens 


cDNA FLJ 10408 fis, clone 
NT2RM4000585 


1039 


76 


1495 


AAB92790 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11286. 


1039 


76 


1495 


gil5156995 


Agrobacterium 

ItllllClCl^lCllo 


AGR_C_3445p 


125 


27 


1 


gio JO / 1 zo 


IV/Tnc mncpnlnQ 

IVxUo JxlUol/U-lLlo 


mDi6 


992 


83 






TTnmn c c>-r\i p~n <5 

J. XUlllVJ StlUXulia 


mRNA for TVTRT comnlete cds. 


771 


62 


1496 


gi6681594 


Homo sapiens 


HSJ2 mRNA for DnaJ homolog, 

rnmnl pf p md 

V^lJIlllJl^' LV^ VIXO . 


771 


62 


1497 


gi8777659 


Homo sapiens 


transcription factor HOXD12 
( HOXD 1 2^ and transcrintion factor 
HOXD 11 (HOXD 11) genes, 
complete cds. 


1262 


96 


1407 


piR7149R 


IVTtis mil senilis 


Hox-4.6 


1237 


93 


1497 


gi397509 


Mus musculus 


HOXD- 11 


1173 


93 


14Z70 


rriQQI ^ 
giyy x O 


"pi n omnHinm 
JT IdoIIlU U-l U.111 

falciparum 


1i vpr Qtacre anticen 
iivci oia.^^ ciiiLi^v_'ii 


148 


20 




m"7^2lQ9 1 H, 


bigemina 


900 kDa antiaen n900 

Jvxvct dllUlgCll J^J^VJW 


130 


21 


1 A QQ 


giZioy4i 


vjvis aries 


■h*ir»V*r\ViT/s* 1 1 ti 
11 lOllUll y alxx JL 


108 


20 


1499 


gil2803855 


Homo sapiens 


clone IMAGE:3616574, mRNA, 

pdl Llctl lUo. 


1095 


100 


1499 


ei6539606 


Homo s aniens 


metastasis suppressor protein 
mRNA, complete cds. 


857 


81 


1499 


gil203820 


Cricetinae gen. 
sp. 


insulin receptor tyrosine kinase 53 
kDa substrate 


130 


24 


1500 


gil2324510 


Arabidopsis 
thaliana 


AIG1 -like protein; 11637-17773 


195 


28 


1500 


gil2324505 


Arabidopsis 
thaliana 


AIGl-like protein; 41 133-42535 


194 


27 


1500 


gi!2324503 


Arabidopsis 


AIGl-like protein; 37301-39301 


166 


29 
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S 
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thaliana 








1 SOI 


ai7^ 84990 


norvegicus 


linmpnH Atnflin nrnteiTi 1\LX 

XXUllXt'UU.VjXXXCXXXX J~ LV^i-AA V 


263 


48 

1 x— ' 


1 ^ni 

J. D\J 1 


ail 9746971 
gll ^ / fOZ / 1 


XvctllUb 

norvegicus 


iClXXicXX XXUXXXGUUvJ A LXClXXQ^/XXLyLXtJXX 

factor Rx/rax 


263 


48 


1501 


gi2240024 


Mus mus cuius 


retinal homeobox protein 


274 


37 


1 cm 




Aqunex aeoncus 


cnromosome assemuiy proiein 
homolog 




94 


1 jUZ 


glZ / 04OJb 


iviy cop lasma 
hominis 


v ad sunace lipoprotein ciuncbiii 
precursor 


/U 


99 






.r yrococcus 
abyssi 


onroiiioisOiiic ocgiegaixoxx pxoicxxi 

(smcl) 


Q7 


21 

Z> X 




ai^969^R9 


jnOIIXO aopiCXla 


(from clone DKFZp434K063); 

nartial r,H<? 


3782 


99 

_/ / 


1503 


AAE01527 


Homo sapiens 


Human gene 15 encoded secreted 
nrotein fragment SFO TD NO'l 84 


3358 


100 


1503 


gil0438230 


Homo sapiens 


cDNA: FLJ21993 fis, clone 


1416 


100 


1504 


g i 10434483 


Homo sapiens 


cDNA FLJ12788 fis, clone 
NT2RP2001946. 


1492 


99 




A A"RQ*i1 87 


xiomo sapiens 


riUlllall piOLClll oCLJ^UCllUC Ol3V^ xJLs 

NO:17252. 


14Q9 

X *T/Z» 


99 




A A f^ni 401 
AAUU 1 ^7 1 


rxomo sapiens 


n.UIlld.11 bCL-IClCU. piOLClll, kjJC/V^ xxJ 

NO: 5572. 


759 


QQ 

/ / 




gn u^fr-?y*+oD 


riomo sapiens 


cj_/1Njtv. r j-ij z.z.y'-t'-t iia, cioiic 
KAT08974. 


X / OJ 


go 

/ / 


l dud 




riomo sapiens 


ITlxvl N /\ , CUrN /\ LJjSJt ^jpHJjHxKJ / o 

(from clone DKFZp434P078); 

prMnnlp'f'p fnc 
V^UIiipiCLC LUd. 


1 JOU 


1 00 


1505 


gi5532389 


Lytechinus 

a 7 a T"i or ci c 
V dl ICgaLUo 


microtubule-associated protein 


96 


24 


1506 


gil0435385 


Homo sapiens 


cDNA FLJ13385 fis, clone 
PLACE1001088. 


1040 


100 


1 ^n^ 


A A RQ^AOQ 


no mo sapiens 


rxUlllctll piOLClll bCHUCllv^C OC^/ j.x^/ 

NO:17793. 


1 040 

X \j'-r\j 


100 

X \J\J 


1 ^06 


glOiJ^f ZU 1 Z) 


On/7d GQ"t"1A/Cl 

V^iyZid. bd-LlVa 


CALCllblil-llxvC IJHJI1I1C~1I^XJL LIlOLdlX 

RiP-5 


60 


24 


1 507 


cn6094684 


T— 1 Vvmo c^iniptiQ 

XXUX11VJ OCIL'ICIIO 


PAP clone RP1-278D1 from X 

X Ix.K*' LlUllt XVX. 1 / OX/ X XX v/XXX ^ v , 

comnlete seauence 


984 


34 


1507 


gi7023516 


Homo sapiens 


cDNA FLJ1 1078 fis, clone 
PT.ACF 1 00 S 1 02 weaklv similar to 

X Xjxvv/XjX VV J X v4<j WwllAiy o XXXXXXCIX Lv 

RING CANAL PROTEIN. 


963 


35 


1 507 


^VZAJ_> _7 ._> *T O V/ 


X_LUxxxv_i odJJlCllc) 


TTnman Tifntpin Qpnnpnrp SIF.O IF) 

XXIXXXXCIXX L/XVJtt/XXX obU LX^/XXv/G OXjV^ XJ_/ 

NO-12768 


963 
/ \j ~j 


35 


1508 


eil4781 88 

&x x. r / o x o o 


Oa11n<? callus 


CHoxE 


597 


75 


1508 


gi6594621 


Danio rerio 


homeobox protein 


314 


57 


1508 


gi7689275 


Xenopus laevis 


homeodomain protein dbx 


310 


50 


1509 


AAG023 66 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6447. 


391 


98 


1509 


gi9965254 


Bos taurus 


RPGR-interacting protein 1 isoform 
b 


107 


42 


1509 


gi9966409 


Bos taurus 


RPGR-interacting protein- 1 


107 


42 


1510 


gi496639 


Mus musculus 


transcription factor Ap-2 beta 


1044 


77 


1510 


gi3309577 


Gallus gallus 


transcription factor AP-2 beta 


1039 


77 
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ID 


Jtilt ILf 


Species 


.Description 


c 

score 


jr ercent 
identity 


1 f 1 A 

ID 1U 


gl!4yD4 1 / 


Homo sapiens 


xl. sapiens niKJNA tor Ar-z Deta 
transcription factor. 


1 nia 


in 


1^11 
ID 1 1 


rri 1 A/f2/^7A*v 

gl 1 U4D0 /<JD 


riomo sapiens 


_T\xt A T7T T1 /IOQA fie ^l^no 

CD IN A rJ-J 14ZoU lis, clone 
PLACE 1005 5 84, weakly similar to 

1 xvrviN o-/\V»> 1 UN VJf 

TRANSCRIPTIONAL PROTEIN 

TPPO 


i4zy 


1 AA 

1UU 


1511 


AAB95825 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18834. 


1429 


100 


1^11 


glOU ID^KfO 


xiyio Dates 
pileatus 


aop amine receptor U4 


1 1Q 

lz>y 


'X'X 


1 CI T 

ID 1Z 


rr* 1 A/1 "3QQ79 

giiu^oyy /z 


nomo sapiens 


cjjjna. ri^jzjoZD lis, clone 

JlIJDJl 1 ZDUD . 


DZ06 


OO 


1512 


gil0438214 


Homo sapiens 


cDNA: FLJ21982 fis, clone 
rlxix^UOi oo. 


2838 


99 


1512 


AAB75305 


Homo sapiens 


Human secreted protein sequence 
encoded oy gene id orii^ id 


2785 


98 


1513 


gil 1275984 


Homo sapiens 


ubiquitin-like fusion protein mRNA, 

LUI11U1CLC t^Llb. 


3802 


99 


1513 


gi2 14866 


Xenopus laevis 


ubiquitin-like fusion protein 


1098 


46 






XvdlLUi) 

norvegicus 




70 1 




1 S 14 


cril S9 1 1*549 

1JZ 1 JJtZ 


jnuiiiu adpiCllb 


XFQD 1 /'MQnn ml?NA r-r»mr^1^f^ 

cds. 


1 1 R9^ 

1 1 OZj 


07 


1514 


gi3329465 


Mus musculus 


NSD 1 protein 


9872 


82 


1514 


gil 043 8794 


Homo sapiens 


cDNA: FLJ22413 fis, clone 

JnJx^UOH- / D . 


5795 


100 


1515 


gil0436342 


Homo sapiens 


cDNA FLJ14001 fis, clone 

V7Q A A 1 009905? 


1404 


99 


1515 


AAB94881 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16098. 


1404 


99 


ID ID 


gi/uzziyo 


Homo sapiens 


CDJNA rLJ lOZo/ ris, clone 
HEMBB1001056, weakly similar to 

DB HT TT7"DT? A TTMVl r^TTT T 

rKULlr JblvA 1 UN 0-L,xii^L 

NUCLEOLAR ANTIGEN PI 20. 


1 An 
1UU 


30 


ID lO 


glD JOU1Z3 


riomo sapiens 


jn i -KJiiN-D / antigen rriKiNA, partial 
cds. 


/o 


dD 


1 J lO 


rri^R AQ 1 OA 

gluoUo 1 o4 


nomo sapiens 


mKJNA, CUINA IJlVrZiP^DH-^Ul lo 

(from clone DKFZp434C01 18); 
partial cos. 


/o 


3D 


1516 


AAY92341 


Homo sapiens 


Human cancer associated antigen 
precursor irom cione in i -xvJC/In-j /. 


76 


35 


1517 


gil0434661 


Homo sapiens 


cDNA FLJ 12903 fis, clone 

"NJT9T? P9 (\(\A*i &A 


840 


99 


1517 


AAB94367 


Homo sapiens 


Human protein sequence SEQ ID 
NO- 14901 


840 


99 


1517 


gi4322670 


Homo sapiens 


dentin phosphoryn mRNA, complete 
cds. 


123 


21 


1518 


gi6807686 


Homo sapiens 


mRNA; cDNA DKFZp434D0513 
(from clone DKFZp434D05 13). 


5133 


100 


1518 


gil0439597 


Homo sapiens 


cDNA: FLJ23035 fis, clone 
LNG02033. 


1475 


100 


1518 


gil 0434608 


Homo sapiens 


cDNA FLJ12871 fis, clone 
NT2RP2003751. 


107 


30 
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SEQ 

ID 


Hit ID 


Species 


Description 


S 


Percent 

identitv 


1 C1Q 

I J Ly 


rri 1 HAIOQ^I 


xiomo Sapiens 


r-TYMA • FT Mil 1 S fi<; Hone 
C1J1NA. rLJZjjlJ lib, uxuxxc 

HEP 12021. 


3908 


100 


1 <C 1 Q 

1 j ly 


rri 1 ClAIA 1 0*7 
gl xUHO^- 1Z / 


no mo sapiens 


CJL/lN/\ r J_fJ IZjOj Ho, UlUXXC 

NT2RM4000848. 


1? 1 7 


99 


ID Ly 


A A"RQid1 11 
J\J\t>yH 1 J j 


nomo sdpienb 


JTLUIxialx piUlClll bCL[U.CIlt/C; uljy XX-/ 

NO: 14393. 


3217 


99 


1 <;if\ 
1 jzU 


rri 1 n/i/i n*i*7*7 
gllU44UJ / / 


nomo sapiens 


mPTsJA fnr "RT 100094 nrntpin nartial 
rillvl N r\ xUl rLiJUUUZ'T pxuicill, paillal 

cds. 




71 




rri 1 O ^ ^*7H9 

gllZDZ) /uz 


i^icer arieununi 


-mf^+Q 1 1 r\f"1iir\ Tiffin 
11 lc LallU LIxlUllClxl 


J o 


66 


1521 


gil3592175 


Leishmania 
major 


PPg3 


145 


24 


1521 


gi5420389 


Leishmania 

ixiaj Ul 


proteophosphoglycan 


127 


25 


1^91 
IjZl 


A A VCM&A1 


nuiiiu bapxcxib 


TTnm?m pYirappllnlar matrix and ppII 

jnLU.Hla.ll CA.LL ClL-CllLllCll 111CIL11W U.11U. 

adhesion molecule-7 TXMAjD-7^ 


113 


25 


1522 


gil3177727 


Homo sapiens 


clone MGC:4606 IMAGE:3356565, 

mT?TsJA f*r\TTTn1 pfp f*d^ 


1293 


96 


1522 


AAY73335 


Homo sapiens 


HTRM clone 1850120 protein 

bCLJ U.C'llwC . 


1288 


95 


1522 


gil817526 


Anthocidaris 
crassispina 


intermediate chain 1 


142 


33 


1523 


gi529662 


Homo sapiens 


H.sapiens SCA1 mRNA for ataxia. 


440 


53 




AA Y3J4y4 


riomo sapiens 


riuman o^/\i proiem. 


Ado 


jj 


1523 


AAR71111 


Homo sapiens 


Spinocerebellar ataxia type 1 (SCA 
i ) piotem proauci. 


436 


50 


1 CI/1 

1dz4 


AALrU4UOD 


nomo sapiens 


riuman secrete a protein, ojdv^ iu 

1NU. O J.H / . 


*+ 1H- 


07 

y / 


1 ^04 
1 J/H 


rriO*7r^nr^ZLQ 

giz / ouo^y 


jLeucama 
separata iiuuicai 

polyhedrosis 

vivi 1 Q 

V 11 L4.o 


VPRf) 
V Jr OU 






1524 


gi2367597 


Drosophila 

X1XV/ XCIXX\J tifclij LU1 


aryl hydrocarbon receptor nuclear 
Iran slocator-li Ice rjrotein 


61 


32 


1525 


gil2659148 


Mus musculus 


mage-e2 


656 


88 


1 s?^ 


£f,l L jL\J J) Z7 X'-r\J 


IVIUO 111U.OV IXX U.O 




478 


43 


1525 


gil2659142 


Mus musculus 


mage-gl 


401 


39 


1 ^96 


glH- D ~) OZuJ 


Zica mays 


L<y lUL/iiiuiiic uj lcuLiotabc 


280 


31 

—> X 


1526 


gil4536592 


Physcomitrella 
paieiis 


PP001069030R 


277 


31 


1526 


gil4536588 


Physcomitrella 
parens 


25_ppprotl_046 eOl 


270 


36 


1527 


gil0439267 


Homo sapiens 


cDNA: FLJ22757 As, clone 


4111 


100 


1527 


gil0437166 


Homo sapiens 


cDNA: FLJ21 129 fis, clone 


919 


51 


1527 


gil0438855 


Homo sapiens 


cDNA: FLJ22457 fis, clone 
HRC09925 


306 


26 


1528 


gil0439267 


Homo sapiens 


cDNA: FLJ22757 fis, clone 
KAIA0803. 


4060 


99 


1528 


gil0437166 


Homo sapiens 


cDNA: FLJ21 129 fis, clone 
CAS06266. 


921 


51 


1528 


gil0438855 


Homo sapiens 


cDNA: FLJ22457 fis, clone 
HRC09925. 


306 


26 


1529 


gi7673675 


Drosophila 
melanogaster 


cactin 


1041 


45 
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ID 
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S 

score 


Percent 
identity 


1529 


gi4204304 


Arabidopsis 
thaliana 


lcl|prt_seq No definition line found 


622 


31 


1529 


AAY07071 


Homo sapiens 


Renal cancer associated antigen 
precursor sequence. 


192 


97 


1530 


gi3 00477 


human, 
endogenous 
retroviral 
element Kl VL- 
Hp3, Genomic 
Mutant, 240 nt, 
segment l oi zj. 
[xiomo sapiens 


pol=reverse transcriptase homolog 
{retroviral element} 


305 


75 


1530 


gi2323287 


multiple 
sclerosis 
associated 
rerrovirus 


polyprotein 


244 


44 


1530 


gi6723273 


Baboon 
cikjo geno us 

virnc ch"!^ in 1VT7 

V 11 LIS O Ll dill IV J. / 


gag-pol precursor polyprotein 


236 


46 


I J D 1 


g 1 0 D *+ \J Zs \J Z, 


lV/Tnc lYiiicfnlnc 

1VJ.U.O lllllol/ UlUo 


1 A 1 ^ rrmtf^in 

U1UIC111 


1990 


97 


1531 


gi7023188 


Homo sapiens 


cDNA FLJ 10876 fis, clone 

NT7RP4001 weaklv similar to 

IN 1 Z/XV1 T^L/vyi O _> O , W t/Clxvl V olllAlldl LVJ 

Homo sapiens CoREST protein 
mRNA 


1205 


59 


1531 


AAB93285 


Homo sapiens 


Human protein sequence SEQ ID 
NO- 12336 


1205 


59 


1532 


gi2252814 


Mus musculus 


FOG 


2128 


70 


1 JjZ 




yVCIlOpUo IdCVla 


FripnH nfn ATA 


1 1SA 


49 


1532 


gi5733391 


Homo sapiens 


zinc finger protein FOG-2 mRNA, 

OUliipiC-Lt l>Uo. 


514 


37 


1533 


gi4079709 


Rattus 

• 

I1U1 V CglU Uo 


reggiel-1 


2106 


99 


1533 


gi4079711 


Rattus 

« 

norvegicub 


reggiel-2 


2063 


96 


1533 


gil3277550 


Homo sapiens 


Similar to flotillin 2, clone 

1V1V_JV^. JU J^, 1 1 JUL VI N /A , L/UllipiCIC l/Uo. 


1862 


99 


1534 


gil4249875 


Homo sapiens 


clone IMAGE:3 161752, mRNA, 

pal Llal vUo. 


3062 


98 


1534 


AAB93093 


Homo sapiens 


Human protein sequence SEQ ID 


452 


34 


1534 


AAB94716 


Homo sapiens 


Human protein sequence SEQ ID 
lsjO'l S774 

IN \J , 1 J i Zt'-r, 


299 


41 


1535 


gil4249875 


Homo sapiens 


clone IMAGE:3 161752, mRNA, 

pal llal LUb, 


3091 


100 


1535 


AAB93093 


Homo sapiens 


Human protein sequence SEQ ID 

"MO-1 1 041 
i > \j . 1 1 yH i . 


480 


35 


j. «j «j 


AAB94716 


1XU111W OuUlvXXO 


Human t> rote in seauence SEO ID 
NO: 15724. 


299 


41 


1536 


gi7655 


Drosophila 
melanogaster 


betaH spectrin 


127 


28 


1536 


gil61334 


Lytechinus 
variegatus 


alpha-spectrin 


120 


28 


1536 


gil495198 


Rattus 
norvegicus 


alphall spectrin 


122 


29 


1537 


gi5669894 


Human 


latent nuclear antigen 


127 


24 
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M.U 


Hit ID 


Species 
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S 

score 


Percent 
identity 






herpesvirus 8 








1 ^7 




T-Tnmn c ntri f*y\ c 
xxuxxxo oauitiio 


«?er/ar&~rich nre-mRNA splicing 
factor SR-A1 (SR-A1) gene, 
complete cds. 


136 


25 


1537 


gi 10440402 


Homo sapiens 


mRNA for FLJ00034 protein, partial 
cds 


136 


25 


1538 


gil2654495 


Homo sapiens 


clone IMAGE:2822295, mRNA, 
partial cds. 


829 


100 


IjjO 


glJ DAAD D 


pictus 


ii \J yj umn v_> 


146 


51 


1 


ai 1 1 1 0 1 074 


XJ dOXXX Lib 

circulans 


BtrG 


88 


34 


1539 


gi 10440 109 


Homo sapiens 


cDNA: FLJ23425 fis, clone 
HEP22862. 


3086 


99 


ID jy 


rr^Q1 1 QSA 

gioy 1 1 yo^f 


riuiiiu od.pi.CIxt> 


nYRNA- rDNA DKF7n434E0335 
(from clone DKFZp434E0335); 

L/dl IXU.X LUo. 


1764 

x / v r 


100 




A AV41 SIR 


XTUIllU odpiOlxo 


T-iYaompTit of* Tin man «;pr/rfitp,d Tvrntein 

encoded by gene 77. 


1172 

X X t J—t 


99 




rri 1 HA^^^OQ 

giiu^oDDyy 


XJ-OrilU bcipiCIlb 


HDNA FT 11^56 fk Hone 

L/XylN jT\. X J-jJ i J J J U XXo, ^/HJllt/ 

PLACE1007705, moderately similar 
tn IVTiis musculus mRNA for Ndrl 
related protein Ndr3. 


1488 


100 


1540 


AAB 95462 


Homo sapiens 


Human protein sequence SEQ ID 
NO* 17944 


1488 


100 


1540 


gil2083721 


Homo sapiens 


mRNA for NDRG3, complete cds. 


1246 


99 


1541 


gil4089673 


Mycoplasma 
pulmonis 


50S RIBOSOMAL PROTEIN L35 


229 


74 


l i 


gl J. D \J I J U 


IVXyt/Upiclalllct 

fermentans 


T"iV"»r»QfYm£i 1 nrntpin T ^ S 

1 iUtJoUilioX LJX VJ LC-XXX X-/J,J 


213 


69 


1 ^zti 


ml AIAIA 1 ^ 1 


O Ldpiiy lULULLUa 

aureus subsp. 

shitpiiq "N/TnSO 

<X L1X Li O IVXUJv 


SO^l rirSoQom?il nrntpin T 

JUu llUUoUll 1C11 UlUlvlU JjJ J 


175 


58 


1 


gll jji-juou 


TTmnn Qnni pn c 


Similar to RIKEN cDNA 

kJXlliXXvll t\J XV11VJ — /J. > VX-'I liT. 

4931428F02 eene clone 
MGC: 14797 IMAGE:4064169, 
mRNA, complete cds. 


750 


97 


1542 


AAY07081 


Homo sapiens 


Renal cancer associated antigen 
nrecursor seauence. 


267 


38 


1542 


gi7020359 


Homo sapiens 


cDNA FLJ20333 fis, clone 
HEP 11252 


206 


31 


1543 


gi4007988 


Mus musculus 


D6MM5e protein 


1651 


74 


1 $A1 


glH-UUO\JU'+ 


1YJLU.O IIlU.aL'U.lU.a 


X_/U1V11VX_>X_/ |JHJLC1XX 


1651 

x vy x 


74 


1543 


gi6010389 


Enterobacteria 
phage Mu 


gpl5 


64 


26 


1544 


AAE03933 


Homo sapiens 


Human gene 36 encoded secreted 
protein HAUCC84, SEQ ID NO: 96. 


62 


32 


1544 


AAE03976 


Homo sapiens 


Human gene 36 encoded secreted 
protein fragment, SEQ ID NO: 168. 


62 


32 


1544 


gil 652299 


Synechocystis 
sp. PCC 6803 


DNA ligase 


82 


32 


1545 


gil3938461 


Homo sapiens 


Similar to RIKEN cDNA 
4933428D01 gene, clone 
MGC:16186 IMAGE:3637472, 
mRNA, complete cds. 


938 


100 
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S 
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1545 


gi3688589 


Triticum 

dco UV UII1 


MADS box transcription factor 


94 


30 


1 f /I c 


giy^o i ouv 


xlOl UCUIU 

vulgare 


\fi A T^) ^»-V»r»v -nrntpiti *5 

lVLrVX-/0 UUA LI HJ It'll! ~J 


93 


30 


1 C A £ 

1 Mo 


1 A/1 ^ 07 1 £ 
gllU4J$Z / 10 


no mo sapiens 


rTYNFA FT T1 14^7 fi«? clone 

L/Jl. 'lNxA. 1 JL-/J 1 1*"T.J / llo, t<iv/iiv> 

HEMBA1001522. 


816 


58 


1 C A 




rionio sapiens 


T-Tumcin nrntpin 5Pniipnr,P S1FO TTj 

JJL Ulllclll LltC- 111 aCk|UWlv*-' k_>l_/V^ 1J_-* 

NO: 16290. 


816 


58 


1546 


gil3122230 


Leishmania 
major 


CDC27/NUC2 -related protein 


110 


28 


1547 


gil4150461 


Mus musculus 


Osal nuclear protein 


827 


36 


1 j4 / 


glJOoyjDD 


rlOmO SapiGIla 


npnp fnr R 1 0CS pvnn 1 7 an fl 

i^CllC 1U1 D GA.VA11 1 / ClllAJ. 

complete cds. 


817 


36 


1Z>4/ 


/>ii /i 1 m 
gil4I jDZjy 


riomo sapiens 


IIlivlN .r\ lUi u VV1 IClatcu. jjiULtsiii, 

complete cds. 


821 


36 


i r /to 

1 j4o 


gi/Uzzy / i 


xiomo sapiens 


rTYNTA FT T 10748 fi<? clone 

IN .rv i. l^J 1 vy / *rO llo, ^/ivlllt' 

NT2RP3001819, weakly similar to 
RTNG CANAL PROTEIN 


578 


43 


1548 


AAB93127 


Homo sapiens 


Human protein sequence SEQ ID 
NO- 1201^ 


578 


43 


1548 


AAB92953 


Homo sapiens 


Human protein sequence SEQ ID 
NO-1 1 635 


234 


27 


1549 


gil0438282 


Homo sapiens 


cDNA: FLJ2203 1 fis, clone 
HFP08734 

1 IJLyJT \J O / Jt. 


1434 


100 


1549 


gi9837427 


Lytechinus 
vanegaius 


embryonic blastocoelar extracellular 

ilia in a. pi u icm |ji cviu owi 


278 


29 

i 


1549 


gil002808 


Lytechinus 
variegarus 


extracellular matrix protein 


152 


25 


1550 


gi7288419 


Treponema 
paiiiQum 


tpr protein K 


68 


48 


1550 


gil 1095629 


Treponema 
pdiiitium. 


TprK 


64 


45 


1550 


gi7288427 


Treponema 
pallidum 


tpr protein K 


63 


45 


1551 


gil 5026770 


Clostridium 
aceioouiy iicum 


Transcriptional regulator, 

lv±ai ISJ Ll/LilL iv laiiniy 


57 


42 


1551 


gi213301 


Heterodontus 
francisci 


H-chain V-region precursor 


60 


48 


1551 


gioiyyi 


Heterooontus 
francisci 


Trr Viaoir^r /"« l"l O "i Tl /'\/ll T^Vl Til 1 

ig neavy cnain ^ vn-j_^ii-jii^ 


60 


48 


1552 


i o/cc^ mi 
gllZOJDiyi 


Homo sapiens 


giuiauiione uransierabe z>eia i 
(maleylacetoacetate isomerase), 

Clone lvxvjr^.z.vz.!7 livi^rvvj JC/ . d Ljyyjy^, 

mRNA, complete cds. 




89 


1 c c o 

1552 


glJMU / J / 


riomo sapiens 


giutaxnione uransierase zeLa i 
(GSTZ1) gene, exon 9 and complete 
cds. 




89 


1552 


gi2832731 


Homo sapiens 


mRNA for maleylacetoacetate 
isomerase. 


529 


89 


1553 


gil0435720 


Homo sapiens 


cDNA FLJ13646 fis, clone 
PLACE1011325. 


858 


100 


1553 


AAB94701 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15686. 


858 


100 


1553 


gi9438179 


Homo sapiens 


inner centromere protein INCENP 
mRNA, complete cds. 


186 


26 
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ID 


fill 111 




T)p5f»r inf ion 


s 

score 


Percent 
identity 


1 J J'f 


gl.7.7 / i 1 Jo 


xHJxlxU bcipiClib 


GTPRP? mRNA for GTP-bindine 
like protein 2, complete cds. 


1286 


99 


1 J j4 { 


r^QQ7 1 1 ^ 

giyy / iijo 


IVXllo 111U&VU.1U.O 


rrTP-ViinHino - lilre nrotein 2 


1257 


98 


1554 


gi7020514 


Homo sapiens 


cDNA FLJ20423 fis, clone 
KAT09589 


1168 


99 


1555 


gil5 155438 


Agrobacterium 

lUlJULeidC lCIla 


AGR_C_894p 


192 


39 


1 J JJ 


gljUoU DZ:? 


oCmZObdOOlld.10111 

yces pombe 


olt'-pi Tirvmol rvo"-T>o<5Qil*>1 P HfltTlflP'P 
ciIaD lHJ/lIiiJiiJ'g "puooiuiv vacixiKA^,^ 

repair protein 


80 


27 


1 JJJ 


glolOJO / / 


JC/Ilt alTIO C D a 

histolytica 


f»nVicinr , PT' l^inrliTicr t^TrttiPlTl— 1 " T^"RT^1 
ClJilallCCl UlilU.lJ.lg UylULClll X, XjJJX 1 


101 

X \J X 


27 


1 J JO 


gi4 / y 1 / 1 


riomo sapiens 


Jl.oapiCIlb llCxLZ IlilViN iA. HJI LyiUlCIll 
Vino cp 


1921 

X Z? X 


87 


1556 


gi507875 


Homo sapiens 


Human NIMA-like protein kinase 1 

/TsTT K 1 ^ mRTsT A nomnlete cds 


1921 


87 


1 




T-Tf\TYin C O T~V1 P Tl C 

x luintj oa|JiC/iio 


i j. mucin ixix^x uiuii/Ui. 


1921 


87 


1 ^7 
1 JJ / 


cri47Q 1 7 1 

gl^T / 7l / 1 


T- Tp»Tnr\ c 5i t\i pn q 
XxtJiliO bctpiCixb 


T-T <janipn<? npl<"7 mRNA for Drotein 

kinase. 


1948 


89 


1 SS7 
1 JJ / 


glJU / O / J 


XTxUIiiXJ odlJidlO 


Human T^TlVTA-lilce "nrotein kinase 1 
(NLK1) mRNA, complete cds. 


1948 


89 


1 SS7 
1 J J / 


X 7ZJJU 


XxUiliVJ baJJlOlXa 


TTnTnan TsJlTC 1 r>rotpin 


1948 


89 


1558 


gi 1043 8 880 


Homo sapiens 


cDNA: FLJ22476 fis, clone 
HRP 10682 


2605 


100 


1558 


gil3469818 


Mus musculus 


bicaudal-C 


2179 


89 


1 J jo 


gl / oUU I OU 


^VCIlOpUo laCVlb 


V*i p d n (\ a 1 — 


1720 


73 ! 


i j jy 


gl 1 UHO DUO J 


nonio bctpiciib 


rDNA FT T1389^ fis clone 
THYRO1001661. 


1979 


99 


1 ^^Q 

1 J J7 


/Vr\_D J O \J O 


xxuiiiu bcipiciio 


1-TnmnTi r\i*o1p in cphiipupp ST^O TT) 

XXUlllClIl yjl.\JL\^/lxl OVLjU^llL-U kJl— /V^ IX/ 

NO- 18306 


1979 

X S 1 ^ 


99 


1559 


gi 10434804 


Homo sapiens 


cDNA FLJ12998 fis, clone 
NT2RP3000267 


1847 


98 


1560 


AAB33840 


Homo sapiens 


Human secreted protein BLAST 
search protein SEQ ID NO: 184. 


65 


28 


1 


gn j iou^jv 


iviyzus persiodc 


uy lUL/iiiuiiic p*+jv/ 


64 


33 


1560 


gi28435 


Homo sapiens 


Human mRNA for apoferritin H 

r» ri q i i"\7T%p 

l/llaxll type. 


79 


24 


1561 


gil4336760 


Homo sapiens 


16pl3.3 sequence section 7 of 8. 


2672 


75 


1 1 

1 JOi 


glJooyoU4 


urosopnua 
ineidiiogabiei 


cramp eu proicm 


274 


29 


1561 


gi557822 


Saccharomyces 
cerevisiae 


mal5 } stal, len: 1367, CAI: 0.3, 

/V1V1 x XI X i_/^rVkJ» 1 X wOVJH-VJ 

GT TTPOAMYT ASF SI CRC 3 2 1 3") 


184 


20 


1 ^67 
1 JOZ 


gl 1 U4j OJUH 


xxuiiiu bapiCIlb 


rDNA FT T1 41 24 fis clone 
MAMMA 1 00249 8 . 


896 


100 


1 ^£7 
1 JOZ 


/\/\XJ>" J / W / 


T— l*/~\ VY~1 ^ OQtMPMC 

XIOIIIU bctpiCllb 


TTnmfm nrr^fpin ipnnpnrp S1FO TT~) 

XXUlllClIl IJ1L/L\_/111 atU Ut/ll^'t/ LJJ_(V^ XJ_V 

NO* 18550 


896 


100 


1562 


gi 15026974 


Homo sapiens 


mRNA for obscurin (OBSCN gene). 


449 


96 


1563 


gi5882290 


Homo sapiens 


Ras guanine nucleotide exchange 
factor 2 mRNA, partial cds. 


6402 


100 


1563 


gi2522208 


Homo sapiens 


Ras-GRF2 mRNA, partial cds. 


6397 


99 


1563 


gi57665 


Rattus rattus 


P140 RAS-GRF 


4114 


65 


1564 


gi5882290 


Homo sapiens 


Ras guanine nucleotide exchange 
factor 2 mRNA, partial cds. 


6412 


100 


1564 


gi2522208 


Homo sapiens 


Ras-GRF2 mRNA, partial cds. 


6407 


99 


1564 


gi57665 


Rattus rattus 


PI 40 RAS-GRF 


4124 


65 
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S 


Percent 
identity 


1 C f c 

1565 


gll 4U4J J 


riomo sapiens 


protein pnospiiaiassc i, icguidiuiy 

/in iriV»i frir^ QiiVnim't 9 done 

1 ilXlllUllAJl J oU-ULillll -t- 3 ^luiivj 

MGC:1327 IMAGE: 3 346 5 73, 

ml? 7\[ A pATtrnlptP cds 


503 


79 


1565 


gi4704218 


Homo sapiens 


mRNA for inhibitor 2 of protein 
phosphatase 1. 


503 


79 


1 JOJ 


gl4 /4jo(5 


nomo sapiens 


T-T cortipnc mTJTsIA for inhibitor 9 

n. bdUlCllb lIXXVl N XT, XVJ1 XXXXXX U J. LUl ^ 

gene. 


503 

»v vy 


79 


1 

OOO 


rri'*99R 1 1 £ 


"\flno t^tii o/"»i l li 1 o 

ivius liiubcuiub 


CC rfipmnVine ARPD-1 

V_/V_/ \j 11C lllvJlvl 1IC /UJvI/ A 


15 


25 


1566 


gi45S5823 


Mus musculus 


CC chemokine DC/B-CK 


75 


25 


1 JOO 


gij /_> /oy^ 


ivius muscuius 


ma r»i*r~\t^1"\ r\ cr —f\ F*ir\\7F k f^ f*Tlf a TT10K 1TIP 
IIlctL/I UUIlcigC _ LlC/l 1 VCU ^HC'llUJlVJ.llt' 

MDC/ABCD-1 


75 


25 


1 JO / 


giooUo/y / 


riomo sapiens 


IIirvlNxT., JJJ\.rZjp^J^lJWU I u 

(from clone DKFZp434B0610); 

jJaiucti i>Uo> 


997 


100 


1 JO / 


gll^O/O / / J 


V^illdIIxyU.OlIlUlldb 

reinhardtii 


rru pi \j lqiii 


130 


27 


1 JO / 


gll4-ZDU4ZO 


nomo sapiens 


C1U11C llVJ_rt. vJJC/ . D OUUii JO, IIJ-LVINZT., 
Udi.lla.1 l/UOi 


111 

XXX 


26 

Z^ V.' 


1 JOo 


gl 1 U^f J OH 1 Z 


nUillU bdjJICllb 


rDNA FT T14050 fi<; clone 
HEMBA 100 6796 


1641 


100 


1568 


AAB95872 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18952. 


1641 


100 


1 JOo 


glZ^OOO / O 


lViyLllLlb CU.U11D 


nrpp 0 1 1 3 O F*Tl P 
piCtUllagCll X 


146 


30 


1569 


gil3603867 


Homo sapiens 


ferritin heavy polypeptide-like 17 
/pnpTjT i 7\ rnRNA comnlete cds 


575 


72 


1569 


gi2 11774 


Gallus gallus 


ferritin H subunit 


527 


63 


i joy 


gizooy co i 


OallUS gdllUb 


-fp> T*i—j fi n T- T r*TiciTn 
1C111L111 1 1 L/lldlll 


527 


63 


1570 


gil4787176 


Mus musculus 


CSMD1 


1576 


68 


1 J /U 


gll4/o/lol 


riomo sapiens 


V^KJD dllLl bUolll lllLLxLipiC IXUllldlllo 

protein 1 short form mRNA, 
complete cds, alternatively spliced. 


1 571 

X ^ / X 


68 


1 J /U 


gl!4 /y4 /ZO 


nomo sapiens 


f T 1 UI anH cnclii Tnn1'Hf>1f a HomfiinQ 1 

V_z \_J [) clllU O Li O XIX 111LX1 LXLJXt? LXV^XXX£XXlXo X 

■nrntpin mR"NTA rriTTiTilfitf* cc\^ 

L.'lv'L^XXl XXXXN-L ^( X^. , VWiliUiVlv vUJ • 


1571 


68 


1571 


gi7019911 


Homo sapiens 


cDNA FLJ20059 fis, clone 
COT 01349 


533 


38 


1 1 

1 J / 1 


gllU<+*+ 1 / D\J 


nomo adpiciib 


kelrTi-like tirotein C3TP1 mRNA 
complete cds. 


447 


35 


1 ^71 
1 J / 1 


gl 1 H-UH-Z'ti/O 


nomo odpiexib 


rDNA FT T147S0 fi<; clone 

Ul/lNiV 1 X^< J X-T - / *J\J XXo, V/XWXXW 

NT?RP^002948 wealclv similar to 

l> i ^jlxjt J V/va»7"0) wt/divxy oxxxxxxcix i\j 

RING CANAL PROTEIN 


447 


35 


1572 


gil 5 145793 


Sus scrofa 


basic proline-rich protein 


405 


32 


1 ^99 
10 /Z 


gl J J>U J J D D 


IViy CO DaL IC1 lull J. 

til Vl f^TT*! 1 1 r» <3 1 c 
LU.U CI \y UiU Olb 


piLlllllC-llOll 1111X^111 XHJlXXUXUg 


386 
S \j \j 


30 


1572 


gi5917666 


Zea mays 


extensin-like protein 


386 


30 


i j / j 


gl 1 J7 J jO 


nomo sapienb 


rTiNA- FT T99QR6 fie rlnne 

{sLJ r\. rJL/JZZ70U lib j V.'lUllt' 

KAT11742. 


456 


96 


1573 


gi31303 


Homo sapiens 


H. sapiens fau mRNA. 


456 


96 


1573 


gi31305 


Homo sapiens 


H. sapiens fau 1 gene. 


456 


96 


1574 


gil0438123 


Homo sapiens 


cDNA: FLJ21916 fis, clone 
HEP03994. 


1187 


91 


1574 


gi6841176 


Homo sapiens 


HSPC263 


550 


46 


1574 


gil3960159 


Homo sapiens 


clone MGC:4584 IMAGE: 3 05 1266, 
mRNA, complete cds. 


550 


46 


1575 


gi2982673 


Homo sapiens 


mRNA for p27, complete cds. 


1033 


100 
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S 
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Percent 
identity 


1575 


gi6469047 


Mus musculus 


C184S protein 


925 


89 


1575 


AAY36056 


Homo sapiens 


Extended human secreted protein 
seauence SEO ID NO. 441. 


707 


91 


1576 


gil 1611473 


Mus musculus 


Deltex3 


517 


51 


1 

13/0 






Human Nntrh qienallinff protein 
Deltex (hZDX V4 


374 


48 


1576 


AAY70439 


Homo sapiens 


Human Notch signalling protein, 
Deltex fhZDXV3 


374 


48 


1577 


gi 14042 190 


Homo sapiens 


cDNA FLJ 14576 fis, clone 
NT2RM4001092 weakly similar to 
ZINC FINGER PROTEIN GL03. 


2615 


96 


1 ^77 
l j / / 


A ARQS1 ?Q 


Mnmn Qanipn^ 


Human nrotein seauence SEO ID 
NO:17133. 


2615 


96 


1577 


gi9651765 


Mus musculus 


zinc finger protein 289 


2450 


90 


1 S7R 


cnQfi97? 1 Q 


norvegicus 


beta-catenin binding protein 


2550 


95 


1 ^7R 




Rattus 
norvegicus 


multidomain presynaptic cvtomatrix 
protein Piccolo 


171 


27 


1 578 

1 J / o 


p-i 1666689 


A/Tus musculus 


aloha-NAC. muscle-specific form 
ep220 

or u 


174 


25 


1579 


ei3 868802 


Mus musculus 


c29 


1708 


74 


1579 


gil2642308 


Ovis aries 


type I keratin intermediate filament 
IRSal 


1668 


72 


1579 


AAB56803 


Homo sapiens 


Human prostate cancer antigen 
nrotein seauence SEO ID NO: 1381. 


1215 


62 


1580 


gi6 164743 


Homo sapiens 


F-box protein Fbx20 (FBX20) 
mRNA, partial cds. 


1002 


100 


1 ^sn 

1 JoVJ 


A AYS^OO^ 

/Vr\ I OJV7J 




F-hnx -nrotein FBP-25 


1002 


100 


1580 


AAY83072 


Homo sapiens 


F-box motif of FBP-25 . 


261 


100 


1 <Q 1 


glzOZZJo 1 


ivie inanoineirno o 
acter 

UlCl 1 1 la ULU U. UJJIlll/ 
US 


L/UIlbCIVCU. pUJlClll 


115 


37 


1 Do 1 




ryruoutLUS) 

abyssi 


HYDROXY AC YLGLUTATHIONE 
HYDROLASE related (EC 3.1.2.6) 
(GLYOXALASE ID 


84 


26 


1581 


AAG04045 


Homo sapiens 


Human secreted protein, SEQ ID 
NO- 8126. 


71 


33 


1582 


gil2655474 


Homo sapiens 


mRNA for keratin associated protein 
9 8 CKRTAP9.8 eenel 


960 


94 


1582 


gil2655470 


Homo sapiens 


mRNA for keratin associated protein 
9 4 (KRTAP9 4 eene) 


931 


93 


1582 


gil2655466 


Homo sapiens 


mRNA for keratin associated protein 
9 2 CKRTAP9 2 eenei 


895 


85 


1583 


gi515644 


Homo sapiens 


Human nucleotide-binding protein 


1112 


98 


1583 

X -J \J -mJ 


gi6018193 


Mus musculus 


nucleotide-binding protein long 
form 


1022 


88 


1583 


AAY73353 


Homo sapiens 


HTRM clone 1870914 protein 
sequence. 


880 


94 


1584 


gil0177622 


Arabidopsis 
thaliana 


gene_id:K6M13.11~ 


190 


36 


1584 


gi499199 


Schizosaccharom 
yces pombe 


uvi22 


157 


33 


1584 


gi3 184086 


Schizosaccharom 


uv-induced protein uvi22 


157 


33 



t 
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SEQ 

TT* 
LD 


Hit ID 


Species 


Description 




JL CI LClll 

identitv 






yces pombe 








LDOD 


rr49fi79/'l''> ^ 

gizu /Z4Z0 


riomo sapiens 


numan non-iens ueia gaimiia- 
crystallin like protein (AIM1) 

IliXviN-rt., pd.1 Hal CUa. 


1 0^6 
1 w w w 


40 


1585 


gi6807694 


Homo sapiens 


mRNA; cDNA DKPZp434L1713 

/frnm r 1nnf* TYK'F7n4'34T 171^V 

partial cds. 


1044 


100 


1585 


gil62917 


Bos taurus 


gamma-B-crystallin 


319 


36 


15oO 


AACjr033o J. 


riomo sapiens 


xiuman secretea protein, ori^ AAJ 
NO: 7442. 


ZUH 


07 

y i 


1 co/: 
1 JOO 


gl5 /0o3 /4 


uniuentiiiea 

DaCUlO virus 


rlrirNJD V -yvl/v 


77 




1586 


gil3195754 


Homo sapiens 


protein tyrosine phosphatase mRNA, 

pdlllcU CUS. 


70 


24 


1587 


gi32 13225 


Homo sapiens 


T-box-containing transcriptional 
activator (TBX10) gene, exon 8, and 
pamai cas. 


953 


97 


1587 


gil4289455 


Homo sapiens 


T-box 1 transcription factor C 
^iJr>A.ii^j n.iiviN/\, complete cas. 


799 


62 


1587 


gil2620817 


Mus musculus 


T-box 1 


786 


67 


1 c o o 
1556 


a a ^ r i Tm 

AAY 1 /zz / 


Homo sapiens 


iruman secretea protein ^cione yai- 


1 on 


Zj 


1 CQQ 


A A VQ/fOlfi 

AA y y^yjo 


riomo sapiens 


riuman secretea proiem cione 
ye78 1 protein sequence SEQ ID 
NO:82. 


y j> 


_> w 


1 

1 JOO 


rri^l 7HQ 


virus 


crckcr i~\ 1 "\ rt^\ f " p» m • nvntpi ti t\ 1 0 ins 

ln84) (652 is 1st base in codon) 


61 

U 1 


34 


1589 


gil0439883 


Homo sapiens 


cDNA: FLJ23259 fis, clone 

PAT 0^770 


2480 


99 


1589 


gi 16463 9 


Sus scrofa 


ribonuclease inhibitor 


269 


30 


1589 


gil4577933 


Mus musculus 


ribonuclease/angiogenesis inhibitor 


111 


31 


1590 


gil309720/ 


Homo sapiens 


riDosomai protem, large, Jr i , clone 
MGC:5215 IMAGE:2900846, 
rnxviN/v, complete cas. 


D /O 


7zl 


1590 


gil4043204 


Homo sapiens 


ribosomal protein, large, P 1 , clone 

IVIOL/. 1 jOIO JlVl/vljrii. jj^jyJZ 1, 

rnRNA, complete cds. 


376 


74 


i cha 

15y0 


glly0234 


riomo sapiens 


xiuman aciaic riDosomai 
phosphoproteinPl mRNA, complete 
cas. 


J /O 


/ i r 


1591 


AAW49700 


Homo sapiens 


Human flavin-containing mono- 
oxygenase lsoiorm x. 


2197 


96 


1591 


gil2006730 


Rattus 
norvegicus 


flavin-containing monooxygenase 

r JVlvJJ 


2127 


72 


1591 


gi349534 


Oryctolagus 
cunicuius 


flavin-containing monooxygenase 
r'lvivjj 


2121 


72 




AAW4Q700 


XAvjiiiw odiJit/iio 


T-Tnman flavin-pnntai'ni'no' mnno- 

XALllllClll JLld V All w W 11 1, till 111 J. g, 111VJ11U 

oxygenase isoform x. 


2197 


96 


1592 


gil2006730 


Rattus 
norvegicus 


flavin-containing monooxygenase 
FM03 


2127 


72 


1592 


gi349534 


Oryctolagus 
cunicuius 


flavin-containing monooxygenase 
FM03 


2121 


72 


1593 


gil0439603 


Homo sapiens 


cDNA: FLJ23040 fis, clone 
LNG02277. 


1793 


99 


1593 


gi6633816 


Arabidopsis 


F1N19.14 


141 


26 
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TD 


Hit ID 


Species 
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S 
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Percent 
identity 






thaliana 








1J7J 


cri*^ 1 446 


"Ti i r* t~\ tc\ c tf* 1 in tti 

l_y 1 U IV Uo LC 11 Llllx 

discoideum 


Tvrotein Viua^ie 

L/ X \J lAs XXX XVXIXCI O V/ 


138 


22 


1 ^04 


ml SOR04^R 


xxUlIJlU oajJxCilo 


clone MGC-9275 IMAGE' 3 866562 
mRNA, complete cds. 


1911 


100 


1 


g,l I JUOl'tjJ 


xxUlllU oaXJlEsllo 


clone 1X400*20227 
IMAGE.4558235, mRNA, complete 
cds 


1911 


100 


1594 


AAB92569 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 10777. 

X 1 * * -*■ firm 


1911 


100 


1595 


gil0435288 


Homo sapiens 


cDNA FLJ13315 fis, clone 
OVARC 100 1525. 


736 


100 


1595 


AAB94572 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15358. 


736 


100 


1596 


AAG02673 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6754. 


359 


100 


1596 


si9916 


Plasmodium 
falciparum 


liver stage antigen 


339 


22 


1 596 


£±3549261 


Dictvostelium 
discoideum 


interactin 

XXX h>%^X V* Mil 


203 


18 


1 597 


eil4993576 


Di ctvostelium 

X-^ X v IV LVAlUlll 

discoideum 


deliriumA 

U.VXJ.X X Wi-X X A^L 


218 


36 


1597 


gi6626184 


Medicago sativa 

^nh^n x varia 


Ran GTPase activating protein 


196 


29 


1597 


gi201910 


Mus musculus 


Tcte-1 peptide 


183 


32 


1 SQR 




xxuiiiu aaxjit/iio 


Human secreted nrotein SEO ID 

X X L4.lxXC4.xx ljv wx w LV-/\_x Ul U Lvillj kJX—/x^ xx—' 

NO: 7428. 


219 


97 


1 SQR 




1-fnrnn co-nipTi <i 

XXU1X1VJ oaLJ IV/IXO 


Human Bl 1 Ap^I antiffen snlice 

xxujixxcvxx jj i x jr\.£y *■ ttiiii&v/u juiiw 

isoformBllC-8. 


112 


40 


1 SQR 

1 J .7 0 


A AYR70 1 R 


Hrvmo QfiniPiiQ 

XX\J111VJ' odLJltllO 


Human immunogenic nrostate 
tumour protein sequence SEQ ID 
NO:379. 


112 


40 


1599 


AAB08765 


Homo sapiens 


A human leukocyte and blood 
related protein fLBAP). 


280 


56 


1599 


AAB74718 


Homo saniens 


Human membrane associated 
protein MEMAP-24. 


267 


60 


1599 


AAG01306 

X X.X jjV. V > V./ X w' \J \J 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5387. 


256 


73 


1600 


eil 1385354 


Homo sapiens 

X AvlllV/ kJ *rV L/1V1AL' 


BAF180 (BAF180) mRNA, 
complete cds. 


8308 


100 


1600 

X \J \J 


eil2083875 


Homo sapiens 


polvbromo-1 CPB1) mRNA, 
complete cds, alternatively spliced. 


5129 


96 


1600 




Crallim paling 


nolvbromo 1 protein 

L/ X y y~J X V-/ X 11V/ X L/ X v— XXX 


4742 


89 


1 601 

X \J\J X 


ail 0434904 


TTnmo ^anien^i 


cDNA FLJ13063 fis clone 
NT2RP3001678. 


2590 


99 


1601 


AAB94448 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15085. 


2590 


99 


1601 


gi 104403 69 


Homo sapiens 


mRNA for FLJ00019 protein, partial 
cds. 


2370 


99 


1603 


AAG01839 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5920. 


446 


95 


1603 


g i459443 


Gallus gallus 


pi 04 chicken Rb 


57 


53 


1603 


gi2829147 


Homo sapiens 


lymphocyte-specific protein 1 
(LSP1) gene, LSP1-5 allele, partial 
cds. 


57 


34 
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ID 


Jtllt ID 


opecies 


lies crip lion 


c 

score 


identity 


1 /Zf\A 

loU4 


gii iyyou4 


xiomo sapiens 


xiuman zinc linger proiem ^zxiz-z d 
mRNA, complete cds. 




1 00 


1 A/1 

10U4 


rri 1 ACiAIA 1 ^ 

gl!4U4Z4 10 


nomo sapiens 


curs A JrijJ 14 / iu us, cione 
NT2RP3000632, weakly similar to 

yTTMP FTNTfTFT? PT?OTFTTvJ 
Zvll >l \s i 11N vJI_/lv x XvvJ 1 JZ/JLTN O^r . 


1 ^74 

1 J / H 


S4 


1604 


AAB93075 


Homo sapiens 


Human protein sequence SEQ ID 

"MTV 1 1 QH9 


1574 


54 


1605 


gi22380 


Zea mays 


CAAT-box DNA binding protein 
suDumt -ts ^rsr - 1 Jts ) 


73 


34 


1605 


AAB52040 


Homo sapiens 


Human secreted protein sequence 
encooeo oy gene ony ijl/ in kj. oy. 


56 


29 


1605 


gi263304 


Bos taurus 


insulin-like growth factor-binding 
piotem 4 j lVjJrr>±^ i f 


71 


37 


1 <A/^ 

loUo 


gll4uUZO /4 


nomo sapiens 


Clone 11VJL/\VjJC/.^!7Z!7DOU, IIlJLviN/\, 

partial cds. 


1 £0 




1606 


gi6537256 


Anopheles 

mini a 

gamoiae 


Toll-related protein 


104 


27 


1 £A/£ 

loUo 


gl4oo4j5Z 


ocnistocerca 
americana 


i oil protern 


o I 




1 

IoU / 


gi4yzy / u i 


nomo sapiens 


(~^C~1A 1 1 /> rvrnfai ^ ml? "NT A r» v\ 1 /=»'f"^» 

^vjri-i io proiem rrusj-N/A., compicLc 
cds. 


719 
/ 1 z, 




10U / 


gl / O / /Uo4 


nomo sapiens 


numan protein x vwy iijulvin/a.. 


719 
/ ix 




luU / 


rr^ 1 ^ai 1 Qon 


riomo sapiens 


oimiiar io v_^vjri- 1 10 pioicixi, cionc 
lVrnP-l is^fi TA/rArrF-4977^8^ 

mRNA, complete cds. 


707 


o 


1608 


gil 1640582 


Homo sapiens 


MSTP037 mRNA, complete cds. 


687 


100 


1 AOS 
1 OUo 




iviy co uac leriuiii 
tuberculosis 


pi\JllllC-IlCH IIlUClll lltllllUliJg 


1^7 


40 


1608 


gi60 15454 


Hylobates 
mueiieri 


dopamine receptor D4 


125 


37 


1609 


gi2138290 


Homo sapiens 


FLU gene, complete cds. 


6603 


98 


< /z r\c\ 

1609 


gi869<561o 


Mus musculus 


rliixi piotern 


UOUU 


7J 


1 (1 c\c\ 

1609 


A A "D CO OA/I 

AAB5oo94 


Homo sapiens 


JbJreast ana ovarian cancer associatea 
antigen protein sequence SEQ ID 
602. 


4 lOl 


OA 

yo 


1610 


gi4808631 


Homo sapiens 


transcription factor-like protein 
JV1K.O-A. ^ivirCLjA. ) mKJNA, complete 
cds. 


532 


94 


1 C 1 A 

1610 


rriOOOCO 1 A 

giooyoziU 


Homo sapiens 


cnromosome a ivioIjj-z protein 
mRNA, complete cds. 


D jZ 




1 tZ 1 A 

1610 


• r An 1 r c 1 

gij931553 


Mus musculus 


olcL593p 


4oZ 


OJ 


1611 


gi!0434431 


Homo sapiens 


cDNA FLJ 12752 fis, clone 
NT2RP2001 174, weakly similar to 
GASTRULA ZINC FINGER 
rKUlJilJN Al^L,Cjr4o. 1. 


390 


53 


1611 


AAB94280 


Homo sapiens 


Human protein sequence SEQ ID 

"NTO-1471 1 


390 


53 


1611 


gil4330448 


Homo sapiens 


mRNA for zinc finger protein 
RINZF (RINZF gene). 


390 


53 


1612 


gil47 14943 


Homo sapiens 


Similar to outer dense fiber of 
sperm tails 2, clone MGQ9034 
IMAGE:3 874501, mRNA, complete 
cds. 


480 


26 


1612 


gi8886477 


Rattus 
norvegicus 


cenexin 2 


389 


25 
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ID 


Hit ID 


Species 


Description 


S 

StUI c 


Percent 

1UC11 LI IV 


1612 


gi2996006 


Homo sapiens 


outer dense fiber protein 2/2 mRNA, 
complete cus. 


345 


24 


1613 


gil47 14943 


Homo sapiens 


Similar to outer dense fiber of 
sperm tails 2, clone MGC:9034 
iJVi/VLjii . j 0 /4DU1, niLviN/v, complete 
cds. 


390 


24 


1613 


gioooo4 / / 


Rattus 
norvegicus 


cenexm z 




AD 


1 1 1 
lOl D 


gizyyouuu 


Jtcattus 
norvegicus 


outer uense iioer piotem ultoh 






1014 


gll dZ>h jyoo 


riomo sapiens 


cione ivivjv_x. i^uh- / 
IMAGE:3626506, mRNA, complete 






1614 


gil0177621 


Arabidopsis 
tnanana 


phytoene dehydrogenase-like 


1109 


49 


1614 


gil3424781 


Caulobacter 
crescentus 


phytoene dehydrogenase-related 
protem 


752 


37 


1615 


gil0435416 


Homo sapiens 


cDNA FLJ13409 fis, clone 

Jr JL/\v^ri i yju 1 / ID. 


2108 


100 


lol J 


A A "DO/1 


riomo sapiens 


riuman protem sequence ojj/V^ lu 
NO:15515. 


? 1 OR 


1 no 


1615 


gi4324457 


Schizosaccharom 
yces pornoe 


caffeine-induced death protein 1 


168 


30 


1616 


AAG04014 


Homo sapiens 


Human secreted protein, SEQ ID 


284 


92 


1616 


gi9622389 


Plasmodium 
iaiciparum 


variant surface protein 


62 


53 


1616 


AAY 19771 


Homo sapiens 


SEQ ID NO 489 from W09922243. 


54 


30 


1617 


gil3774109 


Homo sapiens 


mitochondria solute carrier protein 
^ivioLxJrj mtCiN/\, complete cas, 
alternatively spliced. 


151 


56 


101 / 


A a ti/^n/^s 
/\/\r> OUO j 0 


riomo sapiens 


numan mitocnonciriai soiuie carrier 
protein hMSC-o. 


1 S 1 

i j i 




lOI / 


nil ^.1 Q.1 
gl / Z> / O / OJ 


riomo sapiens 


n lUi j protem ysn lkjidj iiixvtn /v, 
complete cds. 


1 1 

i j i 




lol O 


gllZ /Z-5UUo 


.Lacto coccus 
lactis subsp. 
lacus 


U IN JsJN UW1N r^xvVJ 1 rLliN 




71 
/ 1 


1619 


gil0436353 


Homo sapiens 


cDNA FLJ 14007 fis, clone 

V7Q A A 1 nm/in7 


1454 


99 


1619 


AAB94885 


Homo sapiens 


Human protein sequence SEQ ID 
JNvJ. lolUO. 


1454 


99 


1619 


AAG01968 


Homo sapiens 


Human secreted protein, SEQ ID 

\TA. £0/10 


647 


94 


1620 


gi 1043 63 53 


Homo sapiens 


cDNA FLJ 14007 fis, clone 

Y /yAAlUUZ4U/. 


1335 


94 




An.D7toOJ 


riuiiiu betpiCXlo 


T-fnirmn Tvrr\'i7=»'iTi cpniiPTiPP SIhIi Ti) 

jnuiiidii uivi tcixx ocquciiv^c ojjv^ xjl/ 

NO:16106. 




94 


1620 


AAG01968 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6049. 


528 


83 


1621 


gil4701866 


Dictyostelium 
discoideum 


carmil 


307 


27 


1621 


gil881738 


Acanthamoeba 
castellanii 


myosin-I binding protein Acanl25 


303 


27 


1621 


AAR35072 


Homo sapiens 


Human t-complex associated testes 


119 


28 
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S 


Percent 
identity 








CA.JJ1 coovi'V-j. jji \J it/ 111 1, 






1622 


gil2053149 


Homo sapiens 


mRNA; cDNA DKFZp434G2226 
(from clone DKFZp434G2226); 


1178 


59 


1622 


gil881662 


Drosophila 

IllClaJUUgclo IC1 


kinesin like protein 67a 


928 


50 


1622 


gi3136023 


Schizosaccharom 

yccb puiiiDc 


kinesin-like protein 


850 


48 


1623 


gi6069583 


Mus rnusculus 


JNK-binding protein JNKBP1 


1564 


46 


1623 


gi5911879 


Homo sapiens 


mRNA; cDNA DKFZp434J046 
(from clone DKFZp434J046); 
partial oqs. 


1459 


88 


1623 


gil 1875643 


Myxococcus 


Bapl 


182 


38 


1624 


gil2007334 


Homo sapiens 


IRS-1 PH domain binding protein 
PHIP mRNA, complete cds. 


768 


59 


1 


rr-i 1 A9R6996 
gll4ZoOZZO 


no mo sapiens 


piecKsrrm nomoiogy uonidiii 

JUllCIavllIlg _piUlCJUl, L'lUIlC 

MGC- 1 5 1 87 TMAGE-3 830844 
mRNA, complete cds. 


768 




1 OZ4 


cri 1 9007^6 


lV/fllO TYIllCPlllllC 


FT? ^I- 1 iPTT rlnmain ViinHincr TvrnlT^iTi 

XJlVO l JT X A vlUlllCllll UlllLllllg jJl \J Lt^lll 

PHIP 


768 


59 


JL UwJ 


o-il 1 

gll UJOtOO 


JO.IXIJLJXJ OClJLJICllo 


mRTsJA for R-rpll 

llll\.l>^\. JLU1 JD — tCli 

lymplioma/leukaemia 1 IB (BCL1 IB 

gene;. 


4438 


100 


1625 


gil3094147 


Mus rnusculus 


zinc finger protein rnRitl beta 


4076 


93 


1 69 s 


cri7S467Cn 
gl / O t -x\J 1 ZrD 


A/Tnc Tvmcf nine 

1 VI Lib llJLtloULllLlb 


v^- 1 ±ir z< jji \j Lciii 


3719 




1626 


gi5689375 


Rattus 

I1UI VCglOUb 


tudor repeat associator with 

PPTATRF 9 


2512 


89 


1 696 


cmAsr 6460 

gl*+ J OOH-Ou 


JLlUIlJAJ bctpiC/Ilo 


1 1 JLLvl N fA. ±\JL LU-LlUl ICUCdL asaUtiaLUl 

with PCTAIRE 2, partial cds, 


94.Q8 

jL'-tZs O 


1 00 

1 \J\J 


1 696 


gl 1 J7 JOH-i/O 


n 0 riiu bdpienb 


dUUC llYLrvVjIi. J O JT-ZJ7 , IIlXViN/A., 
Lidl iidi LUo, 


14^1 


1 00 


1627 


AAG03904 


Homo sapiens 


Human secreted protein, SEQ ID 
NO- 7985 


469 


100 


1 627 




1JHJ111\_J oClJJltllO 


T T4/T T6-a<i«inriatpH RNA «?ri1ipinp^ 

factor (PRP3) mRNA, complete cds. 


469 


100 


1627 


ffil390^000 


TV/fiic iTiii^rn1n<; 


RIKEN cDNA 3632413F13 eene 


469 


100 


1628 


gill611734 


Homo sapiens 


GREBla (GREB1) mRNA, partial 

cHs altftTTiRtiv^l v ^nlir^H 

VU-O • UXLVlXlCiLl V vl V JUllVvUk 


1117 


42 


1628 


gi7264653 


Mus rnusculus 


Kiaa0575 


953 


50 


1 698 


cri 1 1 61 1 7^6 
gl JL 1 U 1 1 / j>o 


I — I r\ com ptic 
XT.UII1U bapiCIlb 


complete cds, alternatively spliced. 


Q1 3 


51 
—> i 


1629 


AAY94673 


Homo sapiens 


Human zsig83 protein sequence. 


948 


100 


1 69G 

i ozy 




xiomo sapiens 


riuman zsigo d mamre proxem 
sequence. 


R7£ 
o / O 


1 no 


1 699 


A A R6047S 


T—T/-\ yit r\ cqtai f^n c 
inuillw bdJJlCIlb 


T-TiTman r^^ll eve^e* 5iTir1 Tvrfili'fp'rii'rifYn 

-LlUlllall V^CIJL L/VdC dllU LyUJllXCicH-lLIll 

protein CCYPR-23, SEQ ID NO:23. 


1 97 


39 


1630 


gil4249894 


Homo sapiens 


clone IMAGE:3502329, mRNA, 
partial cds. 


1321 


96 


1630 


AAB12157 


Homo sapiens 


Hydrophobic domain protein from 
clone HP03 165 isolated from KB 
cells. 


1321 


96 


1630 


AAY71116 


Homo sapiens 


Human Hydrolase protein- 14 
(HYDRL-14). 


1321 


96 
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S 


Percent 

irfpntitv 


1631 


gil2483904 


Rattus 
norvegicus 


zinc Finger protein HIT-39 


2085 


87 


1 /Z1 1 

loi 1 


n\lA 1 70Q7 

gl341 /zy / 


nomo sapiens 


numan i^nromosome 10 x3/\vv ciunc 
CIT987SK-A-635H12, complete 
sequence. 


1 007 




1631 


gil 86774 


Homo sapiens 


Human Kruppel related zinc finger 
protein (HTF10) rnRNA, complete 

CQS. 


746 


51 


1632 


AAB24467 


Homo sapiens 


Human secreted protein sequence 
encoQcQ oy gene j>i onv^ ij-^ inw.z/z,. 


243 


34 


1 /en 


A A T? A 1 QQQ 


nomo sapiens 


Jrdriid.1 up±js.^ gene pruu. 


60 


^4 


1632 


AAR85926 


Homo sapiens 


Protein tyrosine-kinase bpTK3 
iragmeni. 


60 


34 


1633 


gil31 11847 


Homo sapiens 


COP9 homolog, clone MGC:1297 

11VLA.VJI}. jjUj/O / , mxvi>l/\, COIIipicLc 

cds. 


652 


94 




gil /3U2o4 


nomo sapiens 


KsKjry signaiosome suDunii i L/oxn i 
^v^oin i ) riir\J.N/\, coinpieLc tub. 




04 


1633 


AAW68581 


Homo sapiens 


Human COP9 protein. 


652 


94 


1634 


gllU43r> jZ4 


nomo sapiens 


cjjina rLJ ij^y l ns, cione 
PT A PP 1004774. 




1 00 

1 \J\J 


1634 


AAB94678 


Homo sapiens 


Human protein sequence SEQ ID 


1526 


100 


1634 


gil 90074 


Homo sapiens 


lysyl hydroxylase (PLOD) rnRNA, 
complete cds. 


141 


31 




glOO / J 1 1 J 


lvius rnus cuius 


poUU liallbv-zlipLlUllal CUlat/LUi Jivi x 


1477 


82 


1 


gl400ZDU 


ivius muscuxus 


molecular weight subunit (NF-H) 


XJKJ 


^0 


1 £1 ^ 
103 J 


glzUUUZZ 


ivius mus cuius 


neuroiiicimeni pruiem 




^0 


1030 


glJ DZ 13o 


jurosopniia 

iriClallOgaa LCI 


uropomyosm lboiorrii iz? 


140 


7^ 


1636 


gi3449364 


Homo sapiens 


intracellular hyaluronic acid binding 

-rYrr»ff*i-n /'TT-1 A T-^T^^ mTJTsJA POTTrnl p+p 

L'lULCill ^XXXrtJ_>JT J llii-Vi > jr\. ? ^Ulll^JlVLC 

cds. 


148 


25 


1 


A A W^QI AS 
VV J7 1 Oj 


nomo sapiens 


Wnmcin T? TT A \A \A r\rr\tp*\r\ 
XXlilllctll J\Xx/Tl1VJ.1V1 piuiciJLi. 


147 


25 


1637 


gil0834638 


Homo sapiens 


TBXX T-box containing protein 


2145 


100 


1637 


gi3 128382 


Mus musculus 


MmTbxl4 


831 


49 


153 / 


s^il 1 /COO /C 1 

gi3 loyzoi 


Mus musculus 


i-dox rranscription iacior 


R74 


4Q 


1638 


gil 1493744 


Drosophila 
melanogaster 


STATHMIN-19 


124 


23 


1638 


gil 1493734 


Drosophila 
melanogaster 


stathrnin-14 


124 


23 


1638 


gil 1493732 


Drosophila 
melanogaster 


stathmin-1 


116 


23 


1639 


gil2005487 


Homo sapiens 


NPD012 (NPD012) rnRNA, 


246 


27 


1639 


gil 043 6679 


Homo sapiens 


cDNA FLJ14262 fis, clone 
PLACE 100 1608. 


213 


27 


1639 


AAB95811 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18804. 


213 


27 


1640 


gil 1141507 


Homo sapiens 


beta protein 1 BP1 rnRNA, complete 
cds. 


1285 


100 


1640 


AAB72699 


Homo sapiens 


Human beta-globin gene repressor 
BP1. 


1285 


100 



WO 03/029271 



PCT/US02/30474 



204 
Table 2A 



ID 


Jtllt JLJJ 


Species 


jjes crip nun 


score 


Pprppnt 

identity 


l04U 


gnoo /ooy 


ivlus museums 


U1X / 


S7Q 


74 


1 /C /I 1 

1041 


gllU4:5 / 1 DO 


riomo sapiens 


r»TYM A • "PT T9 1 1 9/1 fiQ nlrkn^ 
CUlN/\. rl^JZl 1ZH- llo, L/IOI1C 

CAS05964. 


664 


99 


1641 


gil5026993 


Homo sapiens 


partial niRNA for MUC5AC protein 
^mucm gene, iviu^^/\v^^. 


66 


26 


1641 


gi4056437 


Arabidopsis 
tnanana 


Strong similarity to PFAM PF|00069 
riUKaiyoiic protein Kiiiat>u uuiiiain. 


83 


29 


104Z 


gloODDOo / 


riomo sapiens 


m p\TA . f>FlXT A TWCV7r\l f\lV 1 S 1 1 
IXLLvlN/V, CU J. > r\ LJiS^T /L,p 1 UZJC/ 1 Jl 1 

^frnrn rlnnp DKF7n762F1 511^1 




1 00 


1642 


gi6979930 


Homo sapiens 


Maml mRNA, partial cds. 


343 


31 


1642 


gill 127697 


Homo sapiens 


SYT/SSX4v fusion protein 
(SSXT/SSX4v fusion) mRNA, 
partial cas. 


164 


28 


1 £A 1 
104 D 


rri 1 AA 


v^jryza sativa 


partial 


77 


27 


104 d 


rri 1 9AQ9^9"9 

guzoy /D 1 1 


wryza sauva 


Ldbcin isJJlabc 11 ctxpila oUUUliit 


77 


27 


1644 


gi3298472 


Mus musculus 


zinc finger protein 


2662 


88 


1644 


gll/ODoo4D 


riomo sapiens 


neignoor or y-v- Kinase ancnonng 
protein 95, clone MGC:1206 
IMAGE:3504388, mRNA, complete 
cas. 


Jul 


D 1 


1644 


gi6688138 


Homo sapiens 


mRNA for LA95 protein. 


361 


31 


1645 


gi5a524z5 


Homo sapiens 


aouble nomeoDOX protem genes, 
complete cds. 


ZO 1 


A9 
'fZ 


1645 


gi841zU6 


Homo sapiens 


Human Jj4oz4o3 nomeoDox-iiKe 
gene, partial cas. 


9^£ 
Z DO 


A1 
4J 


1645 


gi 143 503 8 


Homo sapiens 


Human facioscapulohumeral 
muscular ays tropny yr oni^ j gene 
region, D4Z4 tandem repeat unit. 


254 


42 


1646 


AAE02058 


Homo sapiens 


Human four disulfide core domain 
(FDCD)-containing protein. 


607 


43 


lo4o 


gllzo!)!>4DZ 


riomo sapiens 


mK.fN/\ ior Keratin associatea protein 
4.7 (KRTAP4.7 gene). 


^9 


A9 


1646 


gi200962 


Mus musculus 


serine 1 ultra high sulfur protein 


551 


44 


1647 


gilUzol 106 


Kattus 

• 

norvegicus 


Kmesin iignt cnam ivivL^t 


/4-> 


on 


1 £.A 1 
104 / 


rri 1 A/til QA O 


Homo sapiens 


cuina rijj lzjo / us, cione 
MAMMA1002637, moderately 

bllinial lO J\Jtl>J£ji311> JUlVJTli.1 

CHAIN. 


IS 1 
jol 




1 &.AH 
104 / 


rri 1 A/ll/1 ^9A 


riomo sapiens 


ClyIN-r\ StLjJ lZo^tJ IIS, Clone 

1^ 1 ZJCSJl jL\J\JD D \J 1 , IllUUCIdtCiy allllilal 

to KINESIN LIGHT CHAIN. 


JOl 




1045 


A AV^Q71 9 


riomo sapiens 


oecretea protem j i -D7-r 1^1 . 


S69 


OR 

J? o 


1648 


gi3288566 


Oryzias latipes 


annexin maxl 


80 


32 


lo4o 


gllDU O /3U 


Sinorhizobium 

lllCilXxJ LI 


Jtl 1 Jr U 1 rlii 1 LKsJKL, 

TR ANSMFTVTRR ANF PROTEIN 


&A 
04 




1649 


gi509241 


Homo sapiens 


Human mRNA for upstream binding 
factor (hUBF). 


652 


45 


1649 


gi55116 


Mus musculus 


transcription factor UBF 


649 


45 


1649 


gi28971 


Homo sapiens 


H.sapiens mRNA for autoantigen 
NOR-90. 


638 


60 


1650 


AAB45436 


Homo sapiens 


Human secreted protein BLAST 
search protein SEQ ID NO: 1 10. 


75 


28 


1650 


gil4210816 


Tupaia 


tl28 


71 


37 
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Vi prn p q vi n i q 

lltsJL L/^/o V XI LAO 








1650 


AAB68057 


Homo sapiens 


Amino acid sequence of a 

rppornhinarit human pelatin 

L U^UlllL' JL1J.CIJL1 L 11L1111C111 Av4 Ulii 


72 


29 


1651 


gi7022392 


Homo sapiens 


cDNA FLJ10390 fis, clone 
NT2RM4000104, moderately 

cimilar tn 7TNC FTNGER PROTEIN 

ollllllCtl l\J Z—iXL T V^- X JL J. N V_l i_/J.V J. J- J— /XL ^ 

135. 


373 


35 


1651 


AAB92770 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11245. 


373 


35 




gl*f OO J JJ 


riomo sapiens 


iriuillclll ziiiu niigci piuiciii ZjInjt 1 jj 

rriRNA, complete cds. 


384 


40 




rri^C 1 ACS 


riomo sapiens 


(from clone DKFZp572C163); 
partial cds. 


3957 


100 

1 \J\J 






riomo bapiciib 


rfYNFA FT T19574 fk Honp 

Ux-'lNxA. ITXvJ 1ZJ / t lies, L^lUllG 

NT2RM4000996, moderately 
similar to ZINC FINGER PROTEIN 
91 


3217 


100 


1 659 




nuiliu oapiciio 


Unman nrnfpin ^pmip/nnp SFO TT) 

NO:17131. 


3217 


100 


1653 


gi4092859 


Homo sapiens 


p53 regulated PA26-T2 nuclear 
trmtein fPA26^ mRNA corrrnletp 

cds 


944 


54 


1653 


gi4092863 


Homo sapiens 


non-p53 regulated PA26-T1 nuclear 
rjrotein (PA261 mRNA comolete 
cds 


942 


56 


1653 


gi!3161393 


Xenopus laevis 


nuclear factor XPA26-T2 


907 


53 


1 654 


m'6807796 


TTomo <2anipn<3 

-L-LVJllHJ OCLL/lVllO 


mRNA- cDNA DKFZn434K0614 

Xi.JJ.VJ. XX ViVi lii J_/X\JL £-^yJ 1 rX.^W w X 1 

(from clone DKFZp434K0614); 
nartial cds 


3458 


100 


1654 




Homo ^anien^ 


HRIHFB2017 mRNA partial cds. 

X. XXXXX XI XV \J X / 111XV1 T X X»a |-/ WXX V ^--*-^- J * 


1132 


100 


1654 


AAY86185 


Homo sapiens 


Nuclear transport protein clone 
hfb20 1 7 nrotein seauence. 


1132 


100 


1655 


gil 1493485 


Homo sapiens 


PR02574 


384 


100 


1655 


AAW73401 


Homo sapiens 


Human secreted protein encoded by 

(T-pnp TsJo 5 


66 


36 


1 U J J 


cn'961 363 


\yfiiQ en 

lVJ.UO OLJ. 


HT-rpll rprpntor alnVia cTiain TOR 
alpha 


60 


34 


1 656 


ml ^^50703 


T-T rv m r\ earn' pn c 


IVTArfF-FIa mRNA comnlete cds 


2593 


99 


1656 


gil4588553 


Homo sapiens 


MAGE El gene, complete cds, 

a 1 f prn a ti vp Qnlipinor 


2593 


99 


1656 


gil3276667 


Homo sapiens 


mRNA; cDNA DKFZp761N1924 
(from clone DKFZp761N1924); 

pnnrnlpfp cc\*i 


2593 


99 


1657 


gil 1493552 


Homo sapiens 


PR01933 


661 


100 


1 658 


cri?5551 8^ 

J J J 1 O J 


IXclLlUb 

norveeicus 


^1PA-1 liVp nrntpin n1 994 

O JT xi. JL IJ-lvt/ LJlVJLV^lll \JX^,Z'^ 


926 

x^ v 


44 

1 1 


1658 


gi3970966 


Homo sapiens 


PAC clone RP5-1 140G11 from 
14q24.3, complete sequence. 


689 


50 


1658 


gi4151328 


Homo sapiens 


high-risk human papilloma viruses 
E6 oncoproteins targeted protein 
E6TP1 alpha mRNA, complete cds. 


689 


50 


1659 


gi2 17346 


Drosophila 
melanogaster 


prospero 


75 


29 


1659 


gil62792 


Bos taurus 


alpha-s 1 -casein precursor 


83 


28 
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1659 


gi92 


Bos taut us 


alpna-o 1 -casein 




9ft 


1660 


gil732073 


TT ' _ 

Homo sapiens 


Human 1 Br-associatea tactor 
(hTAFII130) mRNA, partial cds. 


1 jO 


00 

77 


1 /- /TV 

1660 


gi205832o 


Homo sapiens 


ri, sapiens mKJNA lor 1 ArillJD. 






1660 


AAR56494 


Homo sapiens 


1 A 1 A-Dinumg protern-associatea 
factor hTAFII 130. 


J J J 1 


07 


1661 




rlomo sapiens 


Human isjrueppei-type zinc imger 
protein (ZNF169) gene, final exon, 
pamai cas. 




Q7 


1661 


gi487783 


Homo sapiens 


Human zinc finger protein ZNF133. 


1373 


49 


1661 


gi9392657 


Homo sapiens 


PR-domain containing protein 9 
(rKJL>iViy ) rnKJNA, complete cas. 


1369 


51 


1662 


gil3507379 


Homo sapiens 


BBS2 (BBS2) mRNA, complete cds. 


3715 


100 


1662 


gil4042451 


TT * 

Homo sapiens 


_ T-»X T A T7T T1 /(Tin -CZ~ _1 _„ 

cDNA FLJ 14729 lis, clone 
N IzKFJUUloyo. 


Joo4 


QO 


1662 


AAB93128 


Homo sapiens 


Human protein sequence SEQ ID 

\rrv ,i ini c 

JNO:1201 j. 


3684 


99 


1663 


AAY73489 


Homo sapiens 


Human secreted protein clone 
yk84_l protein sequence SEQ ID 
NO:200. 


165 


55 


1663 


gi2674262 


murine 
adenovirus 1 


33K protein 


72 


45 


1663 


gil 542885 


Homo sapiens 


H. sapiens r lvB4 gene, allele M. 


1 o 


4o 


1664 


AAY45305 


TT ' „ 

Homo sapiens 


Human secreted protein tragment 
encode a irom gene o. 




1UU 


1664 


AAW10687 


Homo sapiens 


Bax omega protein C-terminal 
peptiae use a ior annuo ay 
production. 


60 


41 


l004 


giyy4 /j5U!> 


Jrseuaomonas 
aeruginosa 


piooaDie signia-/u lactor, izy^r 
subfamily 




^7 


loo J 


gl403 1 /Z 


jvlus muscuius 


Cnv 1 

oaX- 1 




7^ 


100 J 


gl /Z4o /y I 


ivius muscuius 


noineoDOx proieux 


1 1 90 

JL X Z- W 


i ~j 


1 DO J 


gl /Z4o /y4 


ivius muscuius 


nomeoaomam protein 


1 1 90 




1666 


gil2053153 


Homo sapiens 


mRNA; cDNA DKFZp434M 1526 
(rrom clone iJJt ; vrz,p4J4ivii jzoj. 


5773 


100 


1666 


gil3097273 


Homo sapiens 


clone IMAGE:3447324, mRNA, 
partial cds. 


1774 


100 


1666 


gil49l94l7 


Mus muscuius 


Similar to KIKJiJN cDJNA 
4yiUDUoUUl gene 


1 977 


78 
/ o 


1668 


gi7582288 


Homo sapiens 


BM-008 


571 


98 


1668 


gi2992630 


Mus muscuius 


mPRL-2 


5/1 


no 


1668 


gi894l59 


Homo sapiens 


Human protein-tyrosine phosphatase 
(HU-PP-1) mRNA, partial sequence. 


571 


98 


1669 


gil752736 


Saccharomyces 
cerevisiae 


gene required for phosphoylation of 
oligosaccharides/ has high 
homology with YJR061w 


185 


41 


1 ooy 


AAdJj4Uo 


rlomo sapiens 


riuman u / k^kjz / gene protein. 




^0 
~j \j 


1669 


gil3539605 


Paramecium 
tetraurelia 


cyclophilin-RNA interacting protein 


170 


35 


1670 


gi7022912 


Homo sapiens 


cDNA FLJ 107 13 fis, clone 
NT2RP3000980. 


493 


48 


1670 


AAB93089 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11933. 


493 


48 


1670 


gil 0434247 


Homo sapiens 


cDNA FLJ12636 fis, clone 
NT2RM4001905. 


168 


43 
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1671 


gil 1643582 


Homo sapiens 


PR-domain containing protein 14 
(PRDM14) mRNA, complete cds. 


3166 


100 


1 fn\ 

10/1 


rri 1 flAIACllA 
gll U*+J*+U / O 


jriomo sapiens 


vv-Lvl > n. 17 _L< J 1/.J JJ lib, L/iuxxe 

NT2RM4000202, weakly similar to 
ZINC FINGER PROTEIN MOK-2. 


3166 

— ' X \J\J 


100 


10/1 




nomo sapiens 


nUIUall piOLClll 0C4UCJ.IL/C OXJ/V^ 11_> 

NO:14326. 


31 66 

— ' L \J\J 


100 


1 o 11 


gl /DzoZ /U 


L/ucumis sativus 


poiy^/v^-uinomg pioiem 




40 


1672 


gi4808585 


Homo sapiens 


KH type splicing regulatory protein 

^ IVIJL 0 Ivl J gCHC 5 CAUll z. aim pal Liai 

cds. 


55 


55 


1 0 /Z 


gljZOO^tO^) 


nomo Sapiens 


iiLivi > iui v^iiiinaci l\*> ii aiioL/i ijj u ui 

collagen type 1 alpha 1 and platelet 
derived growth factor beta, 202 bp. 


53 


56 


1673 


gil4329727 


Secale cereale 


high molecular weight glutenin 

cnVvnTii'i" "\7 
bU.UU.lllL y 


90 


26 


1673 


gi6684164 


Triticum 

aCSUVUIil 


glutenin, high molecular weight 

auuuuj.1 Lj|jc y jjivv/Uioui 


89 


28 


1 fcTTX 
lO/j 


a A r^rno^i 


xiomo Sapiens 


T-TiimQ"n Cf a r»T , < =, t< :i H Y'wrs't&iYi S"Pt^) TD 

XXUlllall SCt/lCLCU. piULClll, OJ3V^ AJL/ 

NO- 7142 


44 


42 


1674 


gil0435111 


Homo sapiens 


cDNA FLJ13196 fis, clone 
NT?"RP^ 004428 wealclv similar to 
CHROMODOMAIN HELICASE- 
DNA-BINDING PROTEIN 4 


750 


38 


1674 


AAB94512 


Homo sapiens 


Human protein sequence SEQ ID 
NO* 15225 


750 


38 


1674 


gi7582284 


Homo sapiens 


BM-006 


646 


44 


1675 


gil4091950 


Mus musculus 


L-threonine 3 -dehydrogenase 


333 


66 




gllUO'fU^Z'f 


l nermopiasma 
acidophilum 


uur-giucobc *+-epiiTierasc reiaieu. 
protein 


1 74 


44 


10/3 


gl l<4oZ^tOO / 


i nermopiasma 
volcanium 


iNui^-sugdr epimerasc 


1 74 

1 / *T 




1 £*7£ 
10/0 


rri 1 7£^1££^ 

gl 1 zOjjOOj 


nomo sapiens 


oione ivivjv^.z jyn iivi/\vjtjc/.j)J)'+uo»j 
mRNA, complete cds. 




1 00 


1 VJ / KJ 


rril A9007S 

gl 1 OZWU / D 


! d.1 dlllC Ul Ulil 

bursaria 

V^lllWlClia VIJLLIO x 




69 


23 


1676 


AAG00829 


Homo sapiens 


Human secreted protein, SEQ ID 
NO- 491 0 


67 


36 


1677 


gil0434750 


Homo sapiens 


cDNA FLJ 12964 fis s clone 
NT2RP2005732. 


2816 


100 


10// 




riomo sapiens 


iriuiiian proLCin sequeiicc oxj/V<j ij--' 
NO: 14949. 




1 00 

1 \J\J 


1677 


gil0433565 


Homo sapiens 


cDNA FLJ12154 fis, clone 
MAMMA1000468. 


2810 


99 


lO/O 


gil 17621o4 


Arabidopsis 

Lllctllcllicl 


AlogU/ /DU 






1678 


gi5458043 


Pyrococcus 
abyssi 


POLYRIBONUCLEOTIDE 
NUCLEOTIDYLTRANSFERASE 
related protein 


373 


34 


1678 


gi2621767 

* 


Methanothermob 
acter 

thermautotrophic 
us 


conserved protein 


349 


33 


1679 


gil0435659 


Homo sapiens 


cDNA FLJ 13605 fis, clone 


793 


100 
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PT Af^F 1 01 O^fi? 






1679 


AAB95488 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18023. 


793 


100 


1679 


gi7020399 


Homo sapiens 


cDNA FLJ20356 fis, clone 

TTTTT) 1 COO 1 

tier LdoZL. 


112 


46 


1680 


gi488555 


Homo sapiens 


Human zinc finger protein ZNF135 
rnKJNA, complete cos. 


1254 


57 


1680 


gi8453103 


Homo sapiens 


zinc finger protein rnRNA, complete 
cds. 


1194 


52 


1680 


gil5012179 


Homo sapiens 


zinc finger protein 16 (KOX 9), 
clone jvlvjo. l j 143 
IMAGE:3949487, mRNA, complete 

COS. 


1194 


52 


1 /l O 1 

1681 


giijyy 1364 

• 


rlomo sapiens 


iviiN vj linger protem i\jur.r iiii\j.n/\, 
complete cds. 




J o 


1681 


gil043 


Homo sapiens 


cuiNA. JrJ-J zi /oO us, cione 
T-TPPnn^?^ 


ZOl 


J o 


i z;oi 
lool 




riomo sapiens 


T-Tuman at-\r\r\tr\t\r > ■inlril'i'i't'rn* trrotpin ^ 
jrlLlHla.Il dpuptUllL/ liUllUllAJl |JHJLClli u 

(AIP-6). 


281 


38 


1682 


AAY48602 


Homo sapiens 


Human breast tumour-associated 

piULClIl \JD . 


60 


36 


1682 


gi2392814 


Mus musculus 


PFTAIRE kinase 


54 


28 


looZ 


a a vinnc7 
AAY jUUo / 


riomo sapiens 


J\ IlUlllall UyL/1111-U.CpcllU.Cl.ll IvIIIcIlsC 

designated hPFTAIRE. 


S4 




1 £Q1 
lOOD 


gllo3Z40 


r> os t auras 


type 11 cytoKerdLin x\ ^iiu.o j 


41 6 




1 /CO 1 


gl4o 1 


r>os taurus 


cytoKeraim o \~> i\j x\jt\j 


41 6 


^ o 


1683 


gil2653737 


Homo sapiens 


keratin 8, clone MGC:1711 
IMAGE:3349233 ? mRNA, complete 
cas. 


456 


65 


1684 


gil 1494213 


Homo sapiens 


Mix.l homeobox-like protein 

^IVllyVl^^ IIltviNr\, ^UIIipieLC CLlo. 


1221 


100 


1 AQA 

10o4 


*n /tq o a i 

giizoyoo^o 


riomo sapiens 


nomeoaomain protein iviiyv ^iviiyv^ 
gene, complete cds. 


1 791 


1 00 


loo4 


gl4Do D joU 


jvlus museums 


iviix-iiKe nomeoDox protein 


/ o*+ 


6Q 


1685 


gil4249890 


Homo sapiens 


clone MGC: 15766 
liviAvjxi.jjui /uz, mi\JN/\, complete 
cds. 


1442 


100 


1685 


gil 043 8763 


Homo sapiens 


cDNA: FU22393 fis, clone 

TTT3 pA7 QQA 


1110 


74 


1685 


gi3287375 


Mus musculus 


C9 


689 


56 


1686 


gil0803417 


Homo sapiens 


mKNA lor Cxrr i4-ielatea protein 
(GPP34R gene). 


1 Z44 


i nn 
1 uu 


1 686 


AAB36607 


Homo sapiens 


Human rLhAn l -zy protem 
sequence SEQ ID NO:29. 


1 

1 Z44 


1 nn 


i s- o /r 

1686 


gil5082415 


Homo sapiens 


golgi phosphoprotein 3, clone 
MGC:20187 IMAGE:4558305, 


Oil 


o / 


1687 


gil0803417 


Homo sapiens 


mRNA for GPP34-related protein 
(GPP34R gene). 


1459 


100 


1687 


AAB36607 


Homo sapiens 


Human FLEXHT-29 protein 
sequence SEQ ID NO:29. 


1459 


100 


1687 


gi7022870 


Homo sapiens 


cDNA FLJ10687 fis, clone 
NT2RP3000312. 


1026 


100 


1688 


gil4804 


Bacteriophage 
phi-80 


cl gene (AA 1 -236) 


1236 


100 
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bpecies 
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c 


irfpnf itv 


1688 


gi476017 


Pectobacterium 
cai oio voruni 


DNA-binding protein 


697 


59 


1688 


gi286176 


Pseudomonas 

» 


negative regulator of pyocin genes 


433 


40 


1689 


gi4704279 


Schizosaccharom 
yucb punioc 


set domain protein; transcriptional 

c 1 1 fino 
SllCllO 


561 


46 


1689 


gi5923931 


Homo sapiens 


MLL2 protein mRNA, partial cds. 


392 


53 


1689 


gi5 123787 


Homo sapiens 


mRNA for trithorax homologue 2. 


392 


53 


loyu 




riomo sapiens 


xiuman npocoion ^i^-r^j protein, 
SEQ ID NO: 201. 






1690 


AAB90570 


Homo sapiens 


Human secreted protein, SEQ ID 

XTO« 1 HQ 
INU. IUo. 


568 


98 


1690 


AAG03582 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7663. 


387 


97 


1691 


gi557822 


Saccharomyces 

• • 

cerevisiae 


mal5, stal, len: 1367, CAI: 0.3, 

A A TVU "VC7 A CT T>f\Q £.Af\ 

AJVlYxl YrSAbl rKjQOQyj 

GLUCO AMYLASE SI (EC 3.2.1.3) 


229 


24 


1691 


gl 13 04 Jo7 


Saccharomyces 
cerevisiae var. 
diastaticus 


glucoamylase 


zzy 


Zh- 


1691 


gi545660 


Saccharomyces 

• * 

cerevisiae, 
YNN295, 
reptiae, iovz aa 


Hkrlp 


203 


23 


1 A"QO 

ioyz 


m* 1 0/1 111 £J1 
gllU4J / / O / 


nomo sapiens 


cuiN J\. JrJLJZiozo us, cione 
COL08076. 


1 fRR 


JO 


1692 


gi8099348 


Homo sapiens 


zinc finger protein (ZFP) mRNA, 
complete cas. 


987 


64 


i oyz 


gl4oojO~> 


riomo sapiens 


numan zinc linger proxem z^in rijj 
mRNA, complete cds. 




Oj 


1 

1 Oyj 


rri 1 7^ 1 1 H07 
gll J>0 1 1UU / 


riomo sapiens 


r^v/^tf* ' Vina CP' XTVPi CD1 ^ mT?>JA 

piotem Kinase in i lj-^dst i d iilivj.>i/\, 
complete cds. 


RA^ 


1 on 


1 


giioyuDz iu 


ivius mus cuius 


oimiiar to protein Kinase in i u-or i«> 






1 07J 


glOoUoZH- 1 


riomo sapiens 


IllJXl N /\ , CJ_^lN/\ J_^JSJL i Z/p i f jHill /ZrU 

(from clone DKFZp434H1720); 


\j i j 


1 \J\J 


1694 


gi786117 


Ensis minor 


nuclear protein 


242 


34 




glJOZ llJi 


nomo sapiens 


iiiiviN/\ ior ivLN/x Dincimg proLcm, 
partial cds, clone: R86. 


94? 




1694 


gi5821153 


Homo sapiens 


mRNA for RNA binding protein, 
complete cas. 


247 


29 


1695 


gi7619884 


Gallus gallus 


muscle derived protein 


976 


52 


1 /cnc 

1695 


AAW49042 


Homo sapiens 


Human low density lipoprotein 
binding protein LBP-3. 




jy 


1 /T r\ c 

1695 


gi7 023731 


TT * 

Homo sapiens 


cDNA FLJ1 1209 lis, clone 
PLACE 1007946. 


oOl 


^y 


loyo 


gllZcOJZ 13 


nomo sapiens 


riuosomai protem -L»io, cione 
MGC:8373 IMAGE:28201 19, 
mRNA, complete cds. 






1696 


gi337493 


Homo sapiens 


ribosomal protein LI 8 (RPL18) 
mRNA, complete cds. 


309 


69 


1696 


gi206724 


Rattus 
norvegicus 


ribosomal protein LI 8 


292 


60 


1697 


gil0439510 


Homo sapiens 


cDNA: FLJ22965 Fis, clone 
KAT10418. 


1157 


100 
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1 6Q7 
i \jy i 


A AO01 784 


T-Tninn ^aniens 

AAW111W OCLLJAL/llO 


Human secreted protein SEO ID 
NO: 5865. 


471 


100 


1697 


gi2772565 


Homo sapiens 


Human PAC clone RP3-404F18 
frnm *X7i93 cnmnlete seauence 

AX Will /LUA>J , vVJlllU Ivlv JuuuvUUVi 


392 


100 


1698 


AAB54170 


Homo sapiens 


Human pancreatic cancer antigen 

rvrntpin Qpnnpnpp StPO I F) T s JO'ti22 

JJAWLClll 3C/L-| Llt/llL/t/ k3A-/V^ ILy JL > v_/ . \JJ-f*-> > 


1225 


98 


1698 


g i 14042471 


Homo sapiens 


cDNA FLJ14737 Fis, clone 
TMT9RP^009971 wealclv similar to 

IN JL^JXl JVV/^ / —J j WL/OJVAV O A A A AAA CLA IU 

SCD6 PROTEIN 


544 


50 


1698 


AAY53005 


Homo sapiens 


Human secreted protein clone 
pm749_8 protein sequence SEQ ID 
NO- 16 


544 


50 


1699 


gi2253297 


Monodelphis 

Hnmpstica 


kinesin homolog 


565 


80 


1699 


gil572665 


Dictyostelium 

rllCPniHpTITTl 
UloUUlUt/ LI 111 


kinesin-like protein K6 


461 


46 




g,IZ< \J J o w w 


±VA Llo 111UOV UlUJ 


Icinesin-like protein 2beta 


443 


42 


1 700 


cri 14049^07 


Hnmn sanipns 

HUlllv oapitillo 


cDNA FLJ 14644 fis clone 
NT2RP2001756, weakly similar to 
ZINC FINGER PROTEIN 84. 


1265 


98 


1 700 

J. / WW 


AAB95176 

AA1J7J A / W 


Hrnnn saniens 


Human protein seauence SEO ID 
NO:17237. 


1265 


98 


1700 


gil5081398 


Homo sapiens 


kruppel-like zinc finger protein 
TZNF300) rnRNA, complete cds. 


270 


44 


1701 


gil3182773 


Homo sapiens 


CDA10 mRNA, complete cds. 


864 


77 


1 701 
1 /Ul 


rri 1 900^997 


AAWlIilJ odjJlCllo 


PDA 01 R m"RNA rnmnlete cds 

Vl/rVU 1 O AAAAVl>.Z».j wVJlllUlv WV*/ wUO • 


864 


77 


1701 


gil3876944 


Homo sapiens 


NEFA-interacting nuclear protein 
"NTTP^O rNTTP^O'i mRNA comnlete 

J. AA — ' W llxll Jul 111AV1 > /ij LUllXL/lvLV^ 

cds. 


864 


77 


1 709 
1 /UZ 


cti1 90^08^ 


IaUIIaU odpiCIlo 


mRNA- rDNA DKF7r»434A1319 

llUVlXXAj ±\ /V L/1V1 £jJJ*TJTTV1 J 17 

(from clone DKFZp434A13 19); 
rnmnlete cds 


1816 

JL VJ* JL VJr 


100 

JV. v \J 


1702 


gi292836 


Homo sapiens 


Human trichohyalin (TRHY) gene, 
cnmnlete cds 


92 


22 


1 70? 

1 / wz. 


A A V^07Q^ 

Ail A JU/7J 


flomn cjarnpriq 

1AW1A1W oclkJAv'AiO 


A human trichohvalin TTRHY^ 
protein 


92 


22 


1703 


AAB30829 


Homo sapiens 


Arnino acid seauence of human 
signal transduction protein SGT5-1. 


1288 


99 


1703 


AAB30830 


Homo s aniens 


Amino acid sequence of human 
signal transduction protein SGT5-2. 


1288 


99 


1703 


AAB30831 


Homo sapiens 


Amino acid sequence of human 
signal transduction protein SGT5-3. 


1288 


99 


1704 


ei200964 


Mus musculus 

ITAliL? Ji Jv JLV* <J V*' L*-- 1 - v * w 


serine 2 ultra hish sulfur protein 


312 


50 


1704 


gi3228237 


Homo sapiens 


UHS KerB gene. 


278 


50 


1704 


ai200962 


Mus musculus 

1 V JL LI O 111UOV141ULI 


serine 1 ultra hi eh sulfur protein 


274 


47 


1705 


gi4416181 


Mus musculus 


ES18 


2267 


88 


1705 


gi4416183 


Homo sapiens 


ESI 8 mRNA, partial cds. 


1109 


98 


1705 


AAB38555 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 34 SEQ ID NO:92. 


721 


90 


1706 


gil88882 


Homo sapiens 


Human tracheo-bronchial mucin 
(MUC4) mRNA, partial cds. 


2181 


74 


1706 


g i454154 


Homo sapiens 


intestinal mucin (MUC2) mRNA, 
complete cds. 


1490 


29 


1706 


gil4973269 

* 


Streptococcus 
pneumoniae 


cell wall surface anchor family 
protein 


1283 


23 
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JLfcaCI 1JJ LIUil 


o 

score 


identity 


1707 


ei 12667440 




"MTT? 1 mI?TsJA pnmnl <=»ff=» rrlc 
INllxi IlAAvJLN,r\, OUlAipiClC LUb. 




QQ 


1707 


AAW59357 


Homo sapiens 


Human retinal degeneration B 1 
polypeptide (hrdgBl). 


5069 


99 


1707 


gi9367838 


Homo sapiens 


mRNA full length insert cDNA 
clone EUROIMAGE 2004661. 


4731 


99 


1708 


gi387055 


Cricetulus 

QT*iseiiR 

&1 IClvUO 


RNA polymerase II largest subunit 


628 


40 


1708 


gi825713 


Homo sapiens 


H. sapiens gene for RNA pol II 
largest subunit, exon 1 . 


631 


38 


1708 


gi36124 


Homo sapiens 


H. sapiens mRNA for RNA 1 631 
polymerase II largest subunit. 1 


38 


1709 


gi26 18752 


Takifugu 
rubripes 


zinc finger protein 


1 1149 


67 


1709 




T-Tfrmr* ccinipno 

rnjiiiu bapicilb 


Human DNA binding protein 
(HPF2) mRNA, complete cds. 


605 


in 


1709 


gl^O'Tj A / A 


X 1U1UU iaJJACAAo 


zinc finger protein mRNA, complete 
cds. 


605 


1 Q 
DO 


1710 


gi26 18752 


Takifugu 


zinc finger protein 


1496 


66 


1710 


gil0434850 


Homo sapiens 


cDNA FLJ13029 fis, clone 
NT2RP3001057, moderately similar 
to ZINC FINGER PROTEIN 91. 


707 


40 


1710 


AAB95278 


Homo sapiens 


Human protein sequence SEQ ID 
NO:17486. 


707 


40 


1711 


m7^281 75 


J. 1 wlllvJ oapiCIlb 


mRNA; cDNA DKFZp762H157 
(from clone DKFZp762H157); 
complete cds. 


2746 


1 OA 

1UU 


1711 


AAB53356 


Homo sfmienQ 


TT 1 j • * 

Human colon cancer antigen protein 
sequence SEQ ID NO:896. 


2746 


1 nn 

1UU j 


1711 


gi31283 


Homo sapiens 


TT T» "V T AO • 

Human mRNA for ezrrn. 


2743 


99 


1712 


ei7328175 


11U111U OClJJlCllCJ 


mRNA; cDNA DKFZp762H157 
(from clone DKFZp762H 1 57); 
complete cds. J 


2404 


QQ 


1712 


AAB53356 


T-ToTTIO CQT>7PT1 Q 


riuman coion cancer antigen protein 
sequence SEQ ID NO:896. 


Z4U4 


yo 


1712 


ei31283 

& A.^ 1 J 


TToinn canipnQ 

AAVJAAAU OdJJiCilO 


Human mRNA for ezrin. | 


2401 


QQ 

yo 


1713 


gi7328175 


Homo sapiens 


mRNA; cDNA DKFZp762H157 
(from clone DKFZ P 762H1 57); 
complete cds. [ 


2589 


99 


1713 


AAB53356 


Homo sapiens 


Human colon cancer antigen protein j 
sequence SEQ ID NO:896. 


2589 


99 


1713 


gi31283 


Homo sapiens 


Human mRNA for ezrin. ! 


2586 


99 


1714 

A / A *T 




nomo sapiens 


sentrin/SUMO-specific protease 
(SENP7) mRNA, complete cds. 


1253 


100 


1714 


gil3276699 


Homo sapiens 


mRNA; cDNA DKFZp5 64G 1 8 1 6 
(from clone DKFZp564Gl 816); 
complete cds. j 


1170 


98 


1714 


AAB94994 


Homo sapiens 


Human protein sequence SEQ ID 
NO:16621. 


379 


55 


1715 


gi488555 


Homo sapiens 


Human zinc finger protein ZNF 135 1 
mRNA, complete cds. 


1258 


60 


1715 


gi5441615 


Canis familiaris 


zinc finger protein | 


1255 


59 


1715 


gi 10437767 


Homo sapiens 


cDNA: FLJ2 1 628 fis, clone 
COL08076. 


1248 


65 


1716 


gil0438660 


Homo sapiens 


cDNA: FLJ22321 fis, clone 1 


1878 


100 
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1716 


gil2655057 


Homo sapiens 


clone MGC:2574 IMAGE: 3 05 1044, 
niKJNAj complete cas. 


1878 


100 


1716 


AAE03642 


Homo sapiens 


Human extracellular matrix and cell 
adhesion molecule-6 (XMAD-6). 


1878 


100 




gi4557143 


Rattus 
norvegicus 


zinc finger protein RIN ZF 


JZU 


JJ 


1717 


gil4330448 


Homo sapiens 


mKJN A ror zinc linger pro tern 
RINZF (RINZF gene). 


jZU 




1717 


gi726284 


Mus musculus 


polyomavirus late initiator promoter 
binding protein 


333 


28 


1718 


gi8272464 


Homo sapiens 


human endogenous retrovirus W 
gagCi.37 u gag (gag; gene, 
complete cds. 


318 


55 


1718 


AAB07702 


Homo sapiens 


Jrrotein encoaea oy tne enaogenetic 
fragment of HERV-W. 


jlo 




1718 


gi5726236 


multiple 

sclerosis 

associated 

retrovirus 

eiement 


gag polyprotein 






1719 


gil3623489 


Homo sapiens 


clone IMAGE:4 109498, mRNA, 
partial cas. 


1007 


99 


1719 


gi72093 1 1 


Homo sapiens 


mRNA for FLJ00005 protein, partial 

COS. 


510 


100 


1719 


gi683545 


Chironomus 
palliaivittatus 


gamma protein constant region 


85 


34 


1720 


gil 0433 129 


Homo sapiens 


cDNA FLJ11792 fis, clone 

rlJbiVirSA 1 UUO 1 Z 1 . 


849 


100 


1720 


AAB93829 


Homo sapiens 


Human protein sequence SEQ ID 

INC. 1 jO^O. 


849 


100 


1720 


gil0440076 


Homo sapiens 


cDNA: FLJ23401 fis, clone 

xlJlJr 1 oo 1j. 


259 


42 


1721 


gil0438872 


Homo sapiens 


cDNA: FLJ22471 fis, clone 
nivv^ i ujzy, 


1681 


100 


1 no 1 

1 /Z 1 


A A VC^^flQ 
AA x oOJUy 


riomo sapiens 


xiuiiian gene /u-encoticu. protciii 
fragment, SEQ ID NO:424. 


\JZs\J 


1 \J\J 


1 ni 1 

1 /Zl 




riomo sapiens 


jTiurQaii gene /u~enuo<jeti proiciii 
fragment, SEQ ID NO:425. 






1722 


gil4039857 


Homo sapiens 


testes development-related NYD- 
£>rzz rnKJNA, complete cas. 


275 


92 


1722 


gil 04443 72 


Phaseolus 
coccineus 


Kunitz trypsin inhibitor protein 


60 


38 


1 '"7 / ""» 

1722 


gi506l39 


Zea mays 


Ec metallothionein class II protein 


Do 


Z / 


1723 


gi3l680 


T T * 

Homo sapiens 


Human mRNA for alpha- 
glucocorticoid receptor (clone OB7). 


zuz / 


1 nn 


1 '791 


glo I OoZ 


riomo sapiens 


numan n xivi n i\ lot Dcta- 
glucocorticoid receptor (clone 
OB 10). 






1723 


gi22 18074 


Homo sapiens 


glucocorticoid receptor (GRL) gene, 
intron H exon 9alpha, and complete 
cds. 


2027 


100 


1724 


gil3529542 


Mus musculus 


RIKEN cDNA 473 1402F03 gene 


1525 


96 


1724 


gi3777529 


Homo sapiens 


retinoic acid receptor responder 3 
(RARRES3) mRNA, complete cds. 


69 


41 
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1 794 


A AW60797 


IxUllllJ OOL/l^/llo 


Ta carotene inducible Gene-3 (TTG3^ 
protein. 


69 


41 


1 79 S 


ail 14.16404 




clone MGC-3855 IMAGE'2905681 
mRNA complete cds. 


1336 


100 


1 79^ 




Homo ^anip/ns 


TT s aniens HAP 1 eene for AP 
endonuc lease 1. 


1336 


100 


1 725 


ail 78747 

til X / vj / ™ / 


Homo saniefis 


Human anurinic/anvrimidinic 
endonuclease fHAPlh) mRNA, 
complete cds. 


1336 


100 


1726 


gil0440181 


Homo sapiens 


cDNA: FLJ23477 fis, clone 
HSI15732. 


814 


100 


1726 


gil3383233 


Mus musculus 


bHLH factor Hes7 


102 


32 


1726 


m21 1358 


Oalliis alius 

v — 1 LX± A. LAO Cjttll U-O 


aloha- 1 collaeen tvoe IX 


66 


55 


1727 


AAG03366 


Homo sapiens 


Human secreted protein, SEQ ID 
NO* 7447 


355 


97 


1727 


eil 504095 


Mus musculus 


DNA-bindine orotein 


85 


50 


1727 


gi 16669 10 


Mus musculus 


neurogenin 2 


85 


50 


1728 


gil2583669 


Homo sapiens 


mRNA for bHLH protein DEC2, 
comDlete cds 


2509 


100 


1728 


AAB70692 


Homo sapiens 


Human DEC2a protein sequence 
SEQ ID NO:2. 


2509 


100 


1 79 8 


A AR70691 


liUlllU OClL'l^llo 


Human OEC!2b "nroteiri seauence 
SEQ ID NO: 12. 


2498 


99 


1 790 

JL 1 jL^7 


ail 1 190Q40 


T-Tr»TYir* c ot^i pn q 

xitJlliU o Cl LI 1 Clio 


S1PAND2 mRNA complete cds 


1607 


99 


1729 


gil 1345048 


Homo sapiens 


SCAN domain-containing protein 2 
(SCAND2) gene, complete cds, 

alt6*rnaHvpl v ^nlifpH 
di tt^iiiciii v v^i y o[jiivvu.i 


1607 


99 


1729 


gi 10434473 


Homo sapiens 


cDNA FLJ 12782 fis, clone 
NT2RP2001869 moderatelv similar 

± X Z«1M Z>vulUu/j lllvUVl ULul V ullllllUl 

to ZINC FINGER PROTEIN 191. 


708 


100 


1 710 


ail 0417165 




cDNA- FLT21290 fis clone 
COLO 1954. 


813 


100 


1 710 


ai94471 66 


"P Q T*Q TYl &t~* 111 TY1 

bursaria 

C^Vilnrplla vims 1 

\_^111U1 V>11C1 V 11 Ho 1 


a61 1R 

Cl\J 1 1 IV 


64 


30 


1730 


gi8895105 


Pseudomonas 

nli a a<=* T') 1 


Cro 


60 


47 


1731 


gil 0440052 


Homo sapiens 


cDNA: FLJ23384 fis, clone 
HEP 16468 


1828 


99 


1731 


gi9947369 


Pseudomonas 

apnicrinn 1 ?!! 

&L/1 Uli lllVJ Cj tl 


agmatinase 


1014 


61 


1 71 1 


ai7544014 

£±1 / -V 1 I W _/ 1 


lv_> Irl LUlil V vUi) 

coelicolor A3(2) 


at?matinase ( fra^menf) 


646 


42 


1 719 


cri 1 4575681 


svnthetic 

o y ii Lilt/ uiL/ 

construct 


plutamatf* debvdTOPeriase 

fciu Lciiiiu. lv^ Livjii y ui ugviiaov 


1453 


98 


1 719 


cxill 707 


xxUlllU OdUlvlifl 


T-TiiiTiaTi ttiRT^JA foT flutaTnatp 

llUilUUl 111LVL ill. lul glUlulUUlw 

dehydrogenase (EC 1.4.1.3., GDH). 

" _Ls . ...V — _ ' 1_ 1 


1453 


98 


1732 


gi31799 


Homo sapiens 


Human mRNA for glutamate 
dehydrogenase (Glud-1, EC 1.4.1.3). 


1453 


98 


1733 


gil3905148 


Mus musculus 


RIKEN cDNA 5430417M23 gene 


1400 


83 


1733 


gi439146 


Saccharomyces 
cerevisiae 


B-type cyclin 


96 


32 


1733 


gi433702 


Schizosaccharom 
yces pombe 


mitotic B-type cyclin 


95 


23 


1734 


gil3936547 


Homo sapiens 


formin-binding protein 17 (FBP17) 


3194 


100 
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mRNA, partial cds. 






1734 


gil0435680 


Homo sapiens 


cDNA FLJ13619 fis, clone 
PLACE 10 10926, weakly similar to 
HYPO 1 HE 1 1CAL 72.2 isJJ 
PROTEIN C12C2.05C IN 
CHROMOSOME II. 


1680 


100 


1734 


AAB94690 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15657. 


1680 


100 


1735 


gi6 164743 


Homo sapiens 


F-box protein Fbx20 (FBX20) 
mRNA, partial cds. 


1002 


100 


1735 


AAY83093 


Homo sapiens 


F-box protein FBP-25. 


1002 


100 


1735 


AAY83072 


Homo sapiens 


F-box motif of FBP-25. 


26 1 


100 


1736 


gil699293 


human, 

megakaryoblasto 
id cell line 
MOLM-1, 
chronic 

myelocytic 

11 • , * . 
leukemia patient, 

mRNA Partial 

Mutant, 916 ntj . 

TT T ' 

[xiomo sapiens 


Evi-l-Evi-1 protein {3' region, 
deletion region} 


57 


40 


1 I DO 


gilMoioy/ 


Agro bacterium 
tumefaciens 


AijrK .L 3 D 4 op 


JO 


'X'X 
DD 


1 /Jo 


gl/04Z4oo 


bus scrota 


cytocniome dd 


^/^ 

JO 




1737 


gil4530677 


Homo sapiens 


mRNA for WNT14, complete cds. 


I6ll 


98 


1 H1H i 


gizozjo / 1 


(jrailus gallus 


Wnt-14 piotein 


14U0 




1737 


gi2605811 


Homo sapiens 


Wnt-like protein Wntl4 gene, partial 
cds. 


681 


100 


1 70 O 

1738 


gil4530677 


Homo sapiens 


mRNA tor WN 114, complete cds. 


1 OA! 


97 


1738 


gi2623871 


Gallus gallus 


Wnt-14 protein 


1554 


82 


1738 


gi2605811 


Homo sapiens 


Wnt-like protein Wntl4 gene, partial 
cds. 


681 


1 00 


1739 


gil2805039 


Homo sapiens 


clone IMAGE:3453830, mRNA, 
partial cds. 


2504 


100 


1739 


gil3276673 


Homo sapiens 


mRNA; cDNA DKFZp761A132 
(from clone DKFZp761A132); 
complete cds. 


1143 


100 


1739 


gi2290392 


Strongyloses 
stercoralis 


IgG and IgE lmmunore active 
antigen recognized by sera from 

. • , •j't j i*i* • 

patients with strongyloidiasis 


85 


50 


1740 


gil0435475 


Homo sapiens 


cDNA FLJ13456 fis, clone 
PLACE1003258, weakly similar to 
EARLY EMBRYOGENESIS ZYG- 
1 1 PROTEIN. 


886 


98 


1740 


AAB94660 


Homo sapiens 


Human protein sequence SEQ ID 


886 


98 


1740 


gi2769562 


Homo sapiens 


H.sapiens mRNA for ZYG 
homologue. 


623 


30 


1741 


gi624922 


Samia cynthia 


arylphorin (partial) 


71 


23 


1742 


AAG00335 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4416. 


863 


99 


1742 


gi5668737 


Mus musculus 


UBE-lc2 


752 


53 


1742 


gi 14042700 


Homo sapiens 


cDNA FLJ14868 fis, clone 
PLACE 10023 95, weakly similar to 


448 


56 
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Mus musculus mRNA for UBE-lcl, 
TTRF-1 r9 TIRF-lc3 






1743 


gil4043850 


Homo sapiens 


Similar to splicing factor (45kD), 

nlnne MGP* 144^9 

IMAGE:4303675, mRNA, complete 


1470 


99 


1743 


AAB08622 


Homo sapiens 


Amino acid sequence of a human 
DRT1 1 1 nolvnentide 


1470 


99 


1743 


gi3746840 


Homo sapiens 


45kDa splicing factor mRNA, 

rnmnlptp cc\^ 


1460 


98 


1744 


gil4020951 


Homo sapiens 


mRNA for FLJ00029 protein, partial 

pd«; 


1293 


99 


1744 


gi7019911 


Homo sapiens 


cDNA FLJ20059 fis, clone 
COLO 1349 


979 


48 


1744 


gi7243777 


Drosophila 

mpl q r\ n era •? tpr 


Diablo 


749 


34 


1745 


AAG02849 


Homo sapiens 


Human secreted protein, SEQ ID 
NO- 6930 


301 


98 


1745 


gi495973 1 


Tetrahymena 

nvri fYvrrrri ^ 


chaperonin-containing-TCP 1 theta 

snbunit 

O 1-1 lw' lilll V 


89 


22 


1 745 


ei940781 


A/fetTi anon vn l 
kandleri 


therrnosorne crianeronin 

IXXVf'X X X XVJ* kJ \J X 1 lv ^ VUU-k/ V/X 1*11111 


104 


25 


1746 


gil0439289 


Homo sapiens 


cDNA: FLJ22774 fis, clone 
KAIA1575 


2313 


100 


1746 


AAB95753 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18665. 


666 


48 


1 746 


A AF01 9^9 


TTrvmo c ci-ni pti q 


Human crpnp 1 pnrodpd "secreted 

XXUXXluXX KjUllv X vllVUUvU O^/V/X^IV/^J. 

nrotein HMIAJ30 SEO ID NO-94 

1/1 KJ iVUl X XX Till lis w ^ L/J-V\y .1. 1 J- ' V— r • ^ I • 


1013 

X V/ X m^f 


47 


MAI 


gil0944334 


Hnmn Rfmiens 

A X U111U uuUlullu 


Hu-Clasr)in rnRNA comnlete cds 

X JL 1/1 Vw/lUuL/lll 1111V1 11 X ^ VV/XXIL/ J.W 


6812 


99 


1747 


gil0944336 


Xenopus laevis 


Claspin 


1680 


54 


1 747 




11U111U odJL/lt?lXo 


Human tTivrotromn OPR N-tenninal 
sequence. 


212 


37 


1 74R 
I / to 




TTntno Qnnipn^ 

11U111U OClL'lV-'llO 


Human T-cell recentor active beta- 

XXU.11XCIXX X vwll i vvvU Ivl UV/UVv LfwlU 

chain (V6-D-J-C) mRNA, clone 
PL5 10 

JL 1 J 9 JL \J • 


60 


39 


1749 


gil2328443 


Homo sapiens 


mRNA for PAPA-1, complete cds. 


1810 


99 


1 749 


ail 9^98441 


IVTiiq mn<5f*n1nQ 

lVXLXO XXXIXoL'UXU.o 


PAPA-1 

X iu V X 


1695 


93 


1 749 

x / « y 


AAG01201 


Hnmn ^aniens: 


Human secreted nrotein SEO ID 
NO* 5282 


623 


100 


1750 


eil 1762214 


Arabidonsis 
thaliana 


AT3e04520 


242 


41 


1750 


e.2654615 


P s eiidomon as 

X lj w LiU\JlllUlltiiJ 

aeruginosa 


L-allo-tlireonine aldolase 

* J tillV x * W1J.A1J.V l*A**Vll*M V 


242 


45 


1750 


ei4982322 


HThermoto Q"a 

X livl XXXVjF L vJ £^CX 

maritima 


L-allo-txireonine aldolase 

1 / W11V W.U V V Allll V Wl^V/ltiUV 


239 


40 


1 7S1 


<*i1 854550 


A/Thq Tnn<ipn1n <j 

XVXLXo XXIIXoL/IXXUlS 


red-1 

x vj. x 


607 


96 


1751 


gil3435627 


Mus musculus 


Similar to nucleoredoxin 


607 


96 


1751 


gi4056568 


Zea mays 


PDI-like protein 


116 


52 


1752 


gi 104404 18 


Homo sapiens 


mRNA for FLJ00044 protein, partial 
cds. 


2034 


66 


1752 


gi 10433645 


Homo sapiens 


cDNA FLJ12221 fis, clone 
MAMMA1001091. 


1086 


65 


1752 


AAB93931 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13927.' 


1086 


65 


1753 


gil3544086 


Homo sapiens 


clone MGC: 13251 


950 


100 
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Pprppnt 

identity 








T1V/T A riTn '/ln^^AI f\ ml? "NT A rnmnletp 

cds. 






1 9^ 
1 / J J 


rriAl HI 79H 
gl41Ul /ZU 


l\ / • 1 1 fvo.i 1 in f» nine 1 

ivius mus cuius 


ItrrrrtVPi nnirlrf* o-n<=»r , l"fir* "FoTmiTl T*f*1pltf*H 

lyuipiiouy te spccixiu xuiijljjlix ic/iaicu 

protein 


209 


34 


1753 


gi6708478 


Mus musculus 


formin-like protein 


209 


34 


1 / j4 


gO J j loll 


riomo sapiens 


pj rp 


\J\JO 


1 00 

1 WW 


1754 


gi6563294 


Homo sapiens 


PIL protein mRNA, complete cds. 


608 


100 


1754 


A A IT 7^71 /TO A 

AAW71684 


Homo sapiens 


Amino acid sequence ol me numan 
tumourigenesis associated protein. 


OUo 


1 no. 

1UU 


1 T C C 

1755 


gil 0799 166 


Homo sapiens 


pro tern Kinase JNjmu-Kl iruviN a, 
complete cds. 


loo4 


QQ 


1755 


gil4290030 


Human 

lmmunoaelicienc 
y virus type i 


pol protein 


Oo 


JU 


1 / JO 


gllU44UZ / j 


riomo sapiens 


CU1N/Y, rLJZjJJu lib, CIOIIC 


lJU 




1 7^£ 
1 / JO 


m'7n9 1 ^£7 


J_yrOoOpilllcl 

melanogaster 


r1 1 1 
ti i.i 


132 


19 


1 7^7 
1 / 3 / 


gl / UZ IjO/ 


i-/lObOpillIa 
lliClallUgab LCI 


r1 1 1 

Ll 1,1 


1 17 

XL/ 


19 


1758 


gil4547148 


Homo sapiens 


mRNA for EGLN2 protein. 


2183 


100 


1758 


gil3649965 


Mus musculus 


cell growth regulator Falkor 


1918 


87 


1 / JO 


rri 1 /t <A TO /I 1 

gl 14.54 /Z41 


ivius muscuius 


"ROT Ml nrntpi'n 


1 OUO 


O \J 


1759 


gil3810677 


Mus musculus 


dachshund-like protein DACH2 


737 


88 


1 /jy 


giooM jzI 


Oallus gallus 


uaenz protein 


ozz 


77 
/ / 


1 O <0 

1 /jy 


gl_> 1UZj54 


riomo sapiens 


mtviN a run lengin msen cuin/\ 
clone EUROIMAGE 381801. 




R9 


1760 


AAB74314 


Homo sapiens 


Human splicing factor 2 protein. 


722 


100 


1 O/Trv 

1760 


gl/0zz!>44 


Homo sapiens 


cujna .T-LJ iU4oZ us, clone 
NT2RP2000153, weakly similar to 

vjAivZ Jr rv^J 1 riliN . 


OlO 


QQ 


1 /o0 


A A "D 00 o/To 

AAriyzoOo 


riomo sapiens 


Human proiem sequence oriv ijj 
NO: 11452. 


OlO 


QQ 


1 o/; 1 

1 /ol 


gll3o4j4U/ 


jvius musculus 


raaiai spoKeneaa-ij proiem 


1 8A1 


Jj 


1 O/C 1 

1 /ol 


gllzUDiUiy 


Homo sapiens 


mlviN Aj CJJlNA lJJS^rZjp4J4IUJ 13 

(from clone DKFZp434I0515); 
compieie cas. 


1 7£7 
1 / o / 


£9 
oz 


1761 


gi2905895 


Sti'ongylocentrot 
us purpurams 


radial spokehead 


1703 


61 


1762 


gi2529737 


Xenopus laevis 


ER1 


979 


44 


1762 


AAY28859 


Homo sapiens 


Human mesoderm induction early 
response protein ER1 . 


965 


52 


1762 


AAY15835 


Homo sapiens 


A human erl protein. 


965 


52 


1763 


gil 043 5 949 


Homo sapiens 


cDNA FLJ13815 fis, clone 
THYRO1000381. 


2949 


99 


1763 


AAB95581 


Homo sapiens 


Human protein sequence SEQ ID 

\T/^i .1 o O A 1 

JN(J: loZ41. 


2949 


99 


1763 


gi861295 


Caenorhabditis 
elegans 


similar to S. cerevisiae HAP4 
transcriptional activator 
(SP:HAP4_YEAST, P 14064); 
similar to regulatory domain of PI3- 
kinase P85-alpha subunit 
(phosphatidylinositol 3 -kinase) and 
BCR, the product of the breakpoint 
cluster region gene 


189 


27 
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identity 


1 H CA 

1 /o4 


giDy 1 1 i/i 


Homo sapiens 


mucin 1 1 (MUOi 1 ) mKiNA, partial 
cds. 


4ol / 


QQ 


1764 


AAY59288 


Homo sapiens 


Human MUCH polypeptide. 


4817 


99 


1764 


gi 1834503 


Homo sapiens 


H.sapiens MUC5B gene (partial). 


621 


29 


1765 


gi59l 1881 


T T 

Homo sapiens 


mRNA; cDNA DKFZp434K156 
(from clone DKFZp434K156); 
partial cds. 


O O O A 

2384 


1 AA 

100 


1765 


gi5830706 


variola minor 
virus 


A42R protein 


64 


32 


1765 


gi5 16427 


Variola virus 


A37R 


03 


32 


1766 


gi7328104 


Homo sapiens 


mRNA; cDNA DKFZp761N1024 
(rrom clone DKr Zp / o 1 JN 1 024 ) ; 
partial cds. 


3570 


99 


1766 


gi 10440043 


Homo sapiens 


cDNA: r LJ 23 3 77 lis, clone 
HEP 1623 7. 


ion/) 

25 /4 


on 


1766 


g i4884314 


Homo sapiens 


mRNA; cDNA DKFZp5 6610746 
(rrom clone DKr Zp5 6 610746); 
partial cds. 


852 


89 


1767 


gil0435911 


Homo sapiens 


cDNA FLJ13787 fis, clone 
PLACE4000670. 


491 


51 


1767 


AAB94757 


TT ■ 

Homo sapiens 


Human protein sequence SEQ ID 
NO:15817. 


491 


C 1 i 

51 


1161 


gi6176338 


Homo sapiens 


ubiquitous tetratricopeptide 
containing protein RoXaN mRNA, 
partial cds. 


488 


51 


1768 


gil620100 


T> 

Paramecium 
bursaria 

Chlorella virus 1 


Pro- and Glu-nch, PENPEV (lOx); 
similar to Streptococcus B antigen, 
corresponds to Swiss-Prot Accession 
Number P2795 1 


1 12 


33 


1 /DO 


gll49 1 JOoz 


Streptococcus 
pyogenes 


fibronectin-binding protein Fba 


i i /r 
1 1 O 




I /DO 




rlomo sapiens 


variably cnargeu A-c v Lal } 
mRNA, complete cds. 


i no 




1769 


gil5147877 


Mus musculus 


Spred-2 


1170 


92 


1769 


gil5147875 


Mus musculus 


Spred-1 


475 


56 


1769 


AAB45199 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID 
NO: 140. 


251 


62 


1770 


gi442368 


Rattus 
norvegicus 


neuronal olfactomedm-related ER 
localized protein 


371 


33 


1770 


gil4250608 


Homo sapiens 


Similar to olfactomedm related ER 
localized protein, clone MGC: 1341 
IMAGE:3349741, mRNA, complete 
cds. 


371 


34 


1770 


gil5079891 


TT * 

Homo sapiens 


1 TV A7 / — t /—I A C\ 1 

clone MGC: 1 967 1 

IMAGE:3352603, mRNA, complete 

CQS. 


371 


34 


1771 


gi99306l4 


Homo sapiens 


steroid receptor RNA activator 
isoform 3 mRNA, complete cds. 


1241 


100 


1771 


gi99306l0 


Homo sapiens 


steroid receptor RNA activator 
isoform l mRNA, complete cds. 


1224 


99 


1771 


gi99306l2 


Homo sapiens 


steroid receptor RNA activator 
isoform 2 mRNA, complete cds. 


1218 


98 


1772 


gil2053l05 


Homo sapiens 


mRNA; cDNA DKFZp434Kl 1 1 
(from clone DKFZp434Kl ll); 


539 


32 
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complete cds. 






1 77? 


i \j x yy x i 


AXUlllU OdJLJlCllo 


pDNA FT T20059 fis clone 
COLO 1349. 


473 


31 


1 777 




^/fnc miiCHiliic 
-LVXUo lllUoL-UlUo 


Qftin-hinHiTio' r^TwfpiTi 

clU Llll UJLLALLlllg yjxvJXS^xxx 


466 


29 


1773 


gil 0437750 


Homo sapiens 


cDNA: FLJ21616 fis, clone 
COL07477. 


375 


100 


1 / / D 


gll J /U 


xiomo sapiens 


IIuLrULUDUlC-aSbUC-laLCU. piULClll, 

RP/EB family, member 2, clone 
MGC:1279 IMAGE: 29 89643, 

IlJUVl>.r\, L/UIlipiClC L/Uo. 


74 


40 


1773 


gil292868 


Homo sapiens 


H. sapiens mRNA for novel T-cell 

d.U 11 V d. LlUIl JJIUICJLLI. 


74 


40 


1 774 


gl 1 ZUjjU / D 


riuiiiu Sapiens 


(from clone DKFZp434N1817); 

fnnrnlptp rHs 


4044 


99 


111 A 


gi6760043 


Homo sapiens 


variable charge protein on X with 
eieht retreats fVCX-8r i mRNA 
complete cds. 


69 


33 


111 A 


eil 1934656 


Homo satuens 

X X\J XXXKJ OUUXvllO 


variablv charged X-A (VOXA^ 
mRNA, complete cds. 


69 


33 


1775 

X 1 1 mJ 


AAB26846 


Homo •sanifin 5 * 

XX\JXXX\J uUUXvllO 


Human A/TASL1 nrotein seauence 

X.X\XXXX%XXX XI—JA—/X L/lV/lvlXJL O VU MVJUlVVa 


248 


33 


1775 


gi3293318 


Caenorhabditis 


leucine-rich repeat protein SOC-2 


243 


29 


1775 


gi3252977 


Caenorhabditis 


Ras-binding protein SUR-8 


243 


29 


1776 


gi7650364 


Rattus 

norvft?i mi 

11V/1. V VfiJ.w Liu 


delta Kalirin-7 


162 


23 


1776 


gi3108193 


Rattus 


Duo 


159 


22 


1776 


gi7767545 


Rattus 

11U1 V vglVUO 


Kalirin-7c isoform 


162 


23 


1777 

XIII 


gi5802824 


Homo <?anipn<3 

xx\j xxxkj ouuiviio 


endo&enoii<? retrovirus HERV-TC1 09 
complete seauence 


841 


64 


\in 


gi5802810 


Homo sapiens 


endogenous retrovirus HERV-K101, 
comolete seauence 


827 


62 


1777 


si5802814 


Homo saniens 


endogenous retrovirus HERV-K.103 
complete sequence. 


839 

V-^ -m-S S 


63 


1778 


gil 1993915 


Homo sapiens 


HOXB9 gene, exon 2 and complete 
cds. 


1334 


100 


1778 


g i440955 


Mus SD. 


Abdominal-B ho mo 1 o g — I I o xB 9 


1285 


96 


1778 


gi4322080 


Danio rerio 


homeobox protein 


897 


70 


1779 


gil 1385660 


Homo sat)iens 


CTCL tumor antieen se57-l mRNA, 
complete cds. 


1654 


99 


1119 

A. 1 1 ~S 


ei4103950 


Sus scrofa 

UUO JvlUlU 


tuftelin 

L. L4- JL kvXXXX 


181 

X \J X 


27 

xw / 


1779 


gi3986746 


Bos taurus 


tuftelin 


184 


27 


1 780 

J. / ou 


A AR1 9406 


x x\Jxxl\J au-jJlcilb 


Ammn ar*iH ?pniiPT)rp r\~F n VinmnTi 
^viiiiiiw auiu o^/UU&ii^/w UJ. a iiLLiiidii 

secreted protein. 


683 


75 


1780 


AAB47275 


Homo sapiens 


hOAT4. 


663 


74 


1780 


AAY52386 


Homo sapiens 


Human transmembrane protein 
HP02000. 


630 


68 


1781 


gi3139158 


Homo sapiens 


LINE-1 like protein mRNA, partial 
cds. 


318 


100 


1781 


AAB44391 


Homo sapiens 


Sequence homologous to human 
secreted protein encoded by gene 8. 


153 


49 


1781 


gi5070622 


Homo sapiens 


retrotransposon LI insertion in X- 


159 


46 
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lintcea retinitis pigmentosa locus, 
complete sequence. 






1 TOO 

1 /oz 


gioo41zoo 


xlomo sapiens 




zuo 


JO 


1782 


AAB57128 


Homo sapiens 


Human prostate cancer antigen 
protein sequence oxiv id in\^. i /uo. 


202 


58 


1782 


gi7020993 


Homo sapiens 


cDNA FLJ20719 fis, clone 

unco i nc\f\A 
xixir 1 /UU4. 


202 


58 


l/Ki 


AACjrUl jUU 


xlomo sapiens 


xiuman secrete a protein, oxiv^ iu 
NO: 5581. 


DID 


00 

yy 


1783 


gi9799628 


Drosophila 
melanogaster 


Nut2 


408 


60 


1783 


gi3 879644 


Caenorhabditis 
eiegans 


T09A5.6 


282 


44 


17o4 


gll4U43z /3 


xlomo sapiens 


clone lrviA^jxi. j dud uzy , mrvrs/v, 
partial cds. 


oH 


zu 


1784 


gio503210 


Cervus nippon 


amelogenm 


OD 




1784 


AAB59012 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence SEQ ID 
/ZU. 


84 


26 


1785 


gil0580910 


Halobacterium 
sp. JNKC-1 


Vng 1407c 


83 


48 


1785 


gi2635307 


Bacillus subtilis 


ysmA 


72 


28 


1785 


gil 5024997 


Clostridium 
acetobutyhcum 


Predicted esterase 


68 


45 


1786 


gil3539682 


Homo sapiens 


golgi-associated microtubule- 
binding protein riwL/lvJ mlviN A, 
complete cos. 


2320 


100 


1786 


gi3005085 


Homo sapiens 


hookl protein (HOOK1) rriRNA, 
complete cas. 


1425 


61 


1786 


gil 5079605 


Homo sapiens 


Similar to hookl protein, clone 
inRNA, complete cds. 


1425 


61 


1 /o / 


gl/UloD 1 / 


xlomo sapiens 


nirviN/V, CUINA X^JSJrZvpH- j^+1V1Uj~) 

(from clone DKFZp434M035). 


1 ^70 

i j / y 


yy 


1 /o / 


■oi i Oil 1 

glo 1 1 oZj5 1 


Homo sapiens 


1 . J | W l5 ^VT A <^AtM1^lat^ A/IP 

Jtsi / miviNA, complete cas. 


OjO 


1 on 


1787 


gi8118229 


Homo sapiens 


B17 long form mRNA, complete 
cas. 


822 


99 


1788 


gil899230 


Homo sapiens 


Human iroquois-class homeodomain 
protein IRX-2a mRNA, partial cds. 


206 


33 


1788 


gi9965418 


Mus musculus 


iroquois-class homeobox protein 
IRX2 


212 


46 


1788 


gi7576708 


Mus musculus 


iroquois-class homeobox protein 
Irxo 


212 


46 


1789 


AAG02679 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6760. 


297 


100 


1789 


gi295671 


Saccharomyces 
cerevisiae 


1 j 1 1 J7 

selected as a weak suppressor oi a 
mutant oi tne suounit /\Lv^+u or i^in/\ 
dependant RNA polymerase I and 
III 


123 


24 


1789 


gi531492 


Saccharomyces 
cerevisiae 


RHP3p 


123 


27 


1790 


gil2053347 


Homo sapiens 


mRNA; cDNA DKFZp586Ml 120 
(from clone DKFZp586Ml 120); 
complete cds. 


1154 


100 


1790 


gil2655852 


Mus musculus 


protein phosphatase- 1 regulatory 


236 


41 
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s 
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subunit 7 






1 7QA 




Jtionio sapiens 


protein pnospnaid.se- 1 icguidiury 
subunit 7 (PPP1R7) gene, exon 1 1, 

OUlIljJlClC clilU a.iiciii«-ci vciy 

spliced product. 


9^4 


4? 


1 7Q 1 


Vri 1 ^id^ SQ79 
gl 1 DHjDy 1 & 


jricjiiivj b «pi crib 


L/1UI1C AiVJl/-YVJFl_y . J *T J \J J J O , 111TVJ.N/A., 

partial cds. 


O \J 


94 


1 701 
i / y i 


glUUVJ 1 1 o 


^cd niayb 


PY+pnoin 1 1 1/" f=» nrntpin 
CALClli) lll-HlvC JJUJlCill 


All 


32 


1791 


gi8163634 


Streptococcus 

JJllC III 11U lllciC 


surface protein PspC 


396 


31 


1792 


gill63177 


Homo sapiens 


Human inducible poly(A)-binding 

piULClll lliXvl > ■£*-•) ksUlllJJlCLC/ L^Uo. 


1447 


75 


1792 


gi2801403 


Homo sapiens 


polyadenylate binding protein 
mRNA, complete cds. 


1447 


75 


1792 


gil3096978 


Mus musculus 


Similar to poly(A)-binding protein, 
cytoplasmic 4 (inducible form) 


1440 


75 


1 70^ 


rri 1 ^77/^7^1 


xioino sapiens 


inrviN/\ ior J7 1 vjj/ ana coiieu-ctui 

rl r»r"r~m in rr^n+dinin cr "1 f'F"Vf~" , f~i' 1 crf^n^i 
uuiiidiii uuiiLciiiiiiig i x y^sKj x gciicy. 


1 OOl 


yy 


1 7Q^ 

i / y d 


cri 1 04^8^69 


Jn.L»llHJ bdJL/lCllo 


rDKTA- FT T779S1 fi<; rlone 
HRC02686 


1 61 

X\J X 


36 


17Q3 
i / y d 


gi / \J^J uoo 


JL AWlllVJ oa-LJlC/llo 


rDNA FT T1 1 1 fi<i clone 
PLACE1007488, weakly similar to 
PUTATIVE RHO/RAC GUANINE 
NUCLEOTIDE EXCHANGE 
FACTOR 


164 


42 


1794 


gi4456148 


Gallus gallus 


paralernmin 


223 


44 


1 704 


oi44S6677 


IV/TllG TnilQ^lllllQ 
IVlLlo lliLloL- U.1 Lib 


nqra 1 pmmi n 


355 


39 


1794 


gi4456613 


Homo sapiens 


mRNA for paralernmin. 


345 


38 


1 70S 


glOUU 1 i o 


ZjCa nidyb 


CALCllblll-liivC piLllClll 


74S 


97 


1795 


gi9280319 


Arabidopsis 

llldlldlld 


extensin protein-like 


219 


25 


1795 


gil572721 


Homo sapiens 


Human megakaryocyte stimulating 

IdUlL/l lllTvlN ^A., OVJ111U1CIC 


218 


22 


1796 


gil0439695 


Homo sapiens 


cDNA: FLJ231 14 fis, clone 


749 


100 


1796 


gi942602 


Ancylostoma 

UrtlllllUlli 


anticoagulant protein 6 precursor 


68 


28 


1 706 

X / 7U 




X -LL/lllvJ baJJICllO 


T-Tnmsin nrri't'FM'n ?pniiPTifp SIFf") TFi 

11L1111C111 IJUJLt^lll itU Ut^liv^ 1 j V^/ 11-/ 

NO- 11240 


74 


32 


1797 


gi 14042627 


Homo sapiens 


cDNA FLJ 14825 fis, clone 
OVARC1000781. 


1978 


100 


1 707 


A ARQ4S^4 


JL -LwlllVJ o dpi Clio 


TTtitti5*ti ywc\\p*\v\ cpnnptirp ^1F(^ TT~l 
XlLUXlall LJlv/lClll oCL[UCllOC/ uivy A-Lv 

NO: 15270. 


1 Q7R 

1 / o 


1 00 

1 uu 


1797 


gi3319457 


Caenorhabditis 
elegans 


contains similarity to O-linked 
GlcNAc transferases 


94 


27 


1 7QQ 

1 /y<S 


glOlOODU / 


ivius museums 




A7S 




1 70S 
i / y o 




AHJlllU aaJJlCllo 


T?"P47 nrntpin tTiT?"\TA rmnnl pff* pnQ 

1V17 tii LJl VJ Lt/lli 111XV1N/A., Odlll|JlCLC wUo . 




43 


1798 


gil4550461 


Homo sapiens 


RP42 homolog, clone MGC: 15099 
IMAGE:3939758, mRNA, complete 
cds. 


428 


43 


1799 


gil3278936 


Homo sapiens 


Similar to RIKEN cDNA 
5430432M24 gene, clone 
MGC:4675 IMAGE:3532660, 
mRNA, complete cds. 


1527 


100 


1799 


gil710282 


Homo sapiens 


Human clone 23803 mRNA, partial 


337 


48 
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cds. 






1799 


gil9923 


Nicotiana 
tabacum 


pistil extensin like protein, partial 
CDS 


117 


31 


1801 


gil2652727 


Homo sapiens 


clone IMAGE:3352566, mRNA, 
■nartial cds 


2142 


100 


1801 


AAB95536 


Homo sapiens 


Human protein sequence SEQ ID 
NO-18138 


2116 


99 


1801 


gil020145 


Homo sapiens 


Human DNA binding protein 
(HPF2) mRNA, complete cds. 


863 


47 


1 CM') 




xiomo bapiCIlb 


int^rlpnVin pnlianrpr TiindiriP" factor 

lllLCl.lCU.iVJ.JJL ClJllClxll/Cl U»XXX\X xxi^ la^iui 

2, 45kD, clone MGC:8391 
TMAGF* 28205 05 mRNA complete 
cds. 


1050 


100 


1802 


gi532313 


Homo sapiens 


Human nuclear factor NF45 mRNA, 
complete cds. 


1050 


100 


1802 


AAB58802 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence SEQ ID 


1050 


100 


1803 


gi 10644754 


Caenorhabditis 
elegans 


GATA-type transcription factor 


92 


32 




gioooUioy 


^aenornaDQiiis 
elegans 


olxTxlxalliy LU I ca£>L xlxlxU'gCxl 

regulatory protein GLN3 (PIR Acc. 
No. S22280), contains similarity to 
Pfam domain* PF00320 CGATA zinc 
finger), Score=20.9, E- 
va i ue= 0 0044 N=l 


92 


32 


1803 


gi5706504 


Schizosaccharom 

vrpc TiornTift 

V vLO L,'V_JXX1L>'V_< 


Highly conserved ring finger; 
similar to veast aoc 1 1 SPC 
component 


74 


35 


1804 


gil4043223 


Homo sapiens 


clone MGC: 15677 
IMAGE-3350001 mRNA, complete 
cds. 


888 


100 


1 OuH 


L l tDAU juu 


A r hTm c\ ot^qi c 

Al CiUlUAJ JL?o1l3 

thaliana 


AT4e20350/F9F13 6 


177 


36 


1 RCiA 




s\tzL U L/dC LCl l u.111 

tumefaciens 


AGR C 894d 


115 


25 


1805 


gil4043223 


Homo sapiens 


clone MGC: 15677 
IMAGE'3350001 mRNA complete 
cds. 


244 


93 




cri 1 S 1 SSzt^S 


A crroh a fa* i*i lirvi 

rVglUUdv LCI 1 Hill 

tumefaciens 


AGR C 894p 


91 


26 


1 QfK 
1 OK) J 


glj j J AO Ah 


TV /fi t c m nopn lnc 


P t^TiLliVp oTowtri far* tor rscftTitor 
RrbFi4 intracellular domain 

i j J. UL/T lli.ll iiWllvlltil \J>VliiuxAi 


65 


27 




gl J. '-rUHOZ.Z. J 


Unmn QQni p>rs c 

1XU111VJ OOJJl^llO 


clone MGC- 15677 
IMAGE:3350001, mRNA, complete 
cds. 


231 


100 


1806 


gil5155438 


Aerobacterium 
tumefaciens 


AGR_C_894p 


84 


26 


1806 


gil592321 


Methanococcus 
jannaschii 


GMP synthase (guaA) 


84 


33 


1807 


gi2407913 


Homo sapiens 


H.sapiens MLN50 mRNA. 


809 


91 


1807 


AAW25767 


Homo sapiens 


Human Lasp-1. 


809 


91 


1807 


gil3506795 


Rattus 
norvegicus 


LASP-1 


758 


85 


1808 


AAB65641 


Homo sapiens 


Novel protein kinase, SEQ ID NO: 


1259 


90 
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168. 






1808 


gi2645810 


Mus museums 


Pftan*e-l 


1 1 

i iyj 


Do 


1808 


AAY30087 


Homo sapiens 


A human cyclin-dependent kinase 
designated hPFl AlKb. 


1194 


68 


1809 


gi 1043 5 062 


T T ' 

Homo sapiens 


T\\T A T^T T1 O 1 /TO -CJ ™ -.1 — — 

cDNA FLJ13163 lis, clone 
NT2RP3003656. 


2ol)4 


nn 
yy 


1809 


A A T~4 r\ C 1 A 1 

AAB95343 


Homo sapiens 


Human protein sequence JSbQ 1JD 
NO:17625. 


zoU4 


oo 


1809 


gi 1043 6604 


Homo sapiens 


cDNA FLJ 14205 Fis, clone 
NT2RP3003155. 


2213 


100 


1810 


AAG01851 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5932. 


A A C 

445 


y<$ 


1810 


gi483843 


Bos taurus 


50 kDa protein 


88 


35 


1810 


AAB56241 


TT • 

Homo sapiens 


Human secreted protein sequence 
encoded by gene 8 SEQ ID NO:335. 


A O 

42 


5U 


1811 


gil3097573 


Homo sapiens 


Similar to thiosulfate 
sulfurtransferase (rhodanese), clone 
MGC:l0492 lMACxb:3ol 1 253, 
mRNA, complete cds. 


1598 


1U0 


1811 


gi432376 


Homo sapiens 


Human rohu mRNA for rhodanese. 


15 /5 


no 
yo 


1811 


gil741864 


T"» i t 

Rattus 

• 

norvegicus 


mercaptopyruvate sulrurtransierase 


13o5 


Q C 

o5 


1812 


AAY00294 


Homo sapiens 


Human secreted protein encoded by 
gene 37. 


240 


100 


18 12 


gi3366918 


A 1 * 1 " 

Arabidopsis 
thaliana 


"VTA T~V TTJ1 J 1_ . , ' j T~« 

NADH dehydrogenase subunit F 


76 


T 1 

31 


1812 


gi5881743 


Arabidopsis 
thaliana 


NADH dehydrogenase ND5 


76 


31 


1813 


gil262852 


Mus musculus 


M 1 7 protein 


134 


29 


1813 


gi486 


Bos taurus 


• j 11 , ■ x rTT 

epidermal keratin VII 


63 


30 


1813 


gi3287141 


Human 

lmmunodeficienc 
y virus type 1 


vpu 


67 


37 


1814 


gil3539682 


Homo sapiens 


golgi-associated microtubule- 
binding protein HOQK3 mRNA, 
complete cds. 


193 


32 


1814 


gi3005087 


T T * 

Homo sapiens 


hook2 protein (HOOK2) mRNA, 
complete cds. 


175 


33 


1814 


AAY82321 


Homo sapiens 


Human protein transport molecule 
(PTAM) SEQ ID NO:5. 


175 


33 


1815 


gil0434781 


Homo sapiens 


cDNA FLJ12985 fis, clone 
NT2RP3000050, moderately similar 
to ZINC FINGER PROTEIN 91. 


C O f\ 

582 


100 


1815 


AAB94396 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 14964. 


582 


100 


1815 


gi7023216 


Homo sapiens 


cDNA FLJ 108 91 fis, clone 
NT2RP4002078, weakly similar to 
ZINC FINGER PROTEIN 91. 


483 


55 


1816 


gi915208 


Sus scrofa 


gastric mucin 


339 


29 


1816 


gi7332056 


Caenorhabditis 
elegans 


contains similarity to Pfam family 
PF00078 (Reverse transcriptase 
(RNA-dependent)), score=79.6, 
E=6.3e-20, E=l 


317 


27 


1816 


gi557822 


Saccharomyces 
cerevisiae 


mal5, stal, len: 1367, CAI: 0.3, 
AMYH YEAST P08640 


311 


29 
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s 

**** 
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GLUCO AMYLASE SI (EC 3 2 1 3") 






1817 


gi2432007 


Mus musculus 


peripherial benzodiazepine receptor 
associated protein; PBR associated 

jJlULt/Ill, X xV± jL*\J 


178 


67 


1817 


gi4490289 


Human 

ctLicuvj v jll Uo ty jjc 

41 


33K protein 


80 


33 


1817 


gil2214169 


Enterococcus 

r*l 1 T*d ^ 

1111 aC 


penicillin binding protein 1 A 


95 


28 


1818 


gil4042046 


Homo sapiens 


cDNA FLJ 14492 lis, clone 
MAMMA 1 0070^7 wealclv similar 

IVl.rAJ.V_LlVA.rA. 1 \J \J _7_J / , W tttKl V 0111U1C11 

to ZINC FINGER PROTEIN 135. 


3402 


99 


1 81 R 


A ARQS 1 O'a 


T-Trunn c a r>i pit q 


TTiiman r>rntf*in QPnnptiPP S»FO IF} 

A J. Ill Hull IJl \J tt/lll oUU U^llvv kJJ_/V<J \.J—r 

NO: 17076. 


3402 


99 


1818 


gil0435504 


Homo sapiens 


cDNA FLJ 13479 frs, clone 
PLACF 1003738 weaklv similar to 
ZINC FINGER PROTEIN 135. 


2739 


99 


1 81 Q 

1017 


A A 001 QOR 


Hntnn qhtm'pti <; 

11U111U OdL/lV/llO 


TTnman secreted rvrntein TT") 

_L 1 LlillClll Sb^lt/LvU. IJi. w L^l 1 1, UJL/V^ ±.t. S 

NO: 5989. 


307 


100 




cn1 S09-SQ41 


1 r\ q Ixi rl m tn 

V/iUO LI 1U.1 Hill 

acetobutylicum 


■""i-alanine-D-alanine lipase 

jl/ aiciiiiiiv^ x«/ uicuiiiiv xiguow 


49 


24 


1 81 0 


o-i 1474701 4 


O VOL JLF11 V 1\J U \s L/ llo 

aureus subsp. 

aureus A/fii"S0 


RNase HIT 

iviNciac mi 


52 


25 


1820 


gil3278819 


Homo sapiens 


clone MGC:2776 IMAGE:2959536, 
mRNA comtilete cds 

X X -Li. VX i J. Jk. ^ vVllljJlvLv Vx' V_i. C 1 • 


639 


100 


1820 


gil3278906 


Homo sapiens 


clone MGC:4440 IMAGE:2959536, 
mRNA comtilete cds 


639 


100 


1820 


AAG01158 


Homo sapiens 


Human secreted protein, SEQ ID 
NO* 5239 


502 


100 


1821 


AAB67454 


Homo sapiens 


Amino acid sequence of a human 
chaperone polypeptide. 


1097 


99 


1 891 

J. o/. 1 


giU 1 / 7UUO 


yces pombe 


LillclJ U1ULC1JL1 


mJ +J mJ 


39 


1821 


gi9294116 


Arabidopsis 

tVia 1 1 a nfi 

LLICLI IdllcX 


dnaJ protein-like 


346 


40 


1822 


gi3860093 


Homo sapiens 


MDC-3.13 isoform 2 mRNA, 

prvrrvnlpte cds 


655 


60 


1822 


gil3529164 


Homo sapiens 


TNF-induced protein, clone 
MGC- 12451 TMAGF-3997650 
mRNA comnlete cds 


655 


60 


1822 


gil3937826 


Homo sapiens 


TNF-induced protein, clone 
MGC: 12346 IMAGE:3930240, 
mRNA comnlete cds 


655 


60 


1823 


gil 65780 


Oryctolagus 

mmiPiiliiQ 

1/ LUllls HI U.O 


ubiquitin conjugating-protein 


379 


100 


1823 


gi207555 


Rattus 
norvegicus 


ubiquitin conjugating-protein 


379 


100 


1823 


gil237240 


Mus musculus 


ubiquitin-conjugating enzym 


379 


100 


1824 


gil769491 


Homo sapiens 


Human kruppel-related zinc finger 
protein (ZNF184) mRNA, partial 
cds. 


691 


33 


1824 


gi868160 


Rattus 
norvegicus 


Cys2/His2 zinc finger protein 


567 


37 


1824 


gil3752754 


Homo sapiens 


zinc finger 1111 mRNA, complete 


545 


37 
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c± c r» r* i t* f 1 f\ n 
XlcsLI ipilUll 


c 

score 


X CI tCH I 

identity 








cr\ c 

OUb. 






1 R9S 

1 OZ J 


m974Q771 

g,!^. / *Tjr 111. 


IVlUo IIlU.bC> UlUb 


topp_ 1 


1 1 99 

1 1 Z,^w 


79 


1825 


AAB63598 


Homo sapiens 


Human gastric cancer associated 

diiiigcn proieiri bcquciit>c onv^ i-l* 

NO:960. 


135 


22 


1 R9S 


0"i1 67ft^S 
gil \J I OjJ 


i f* i~\ tc\ o *f ii ii m 
LyUo LC11UI11 

discoideum 


inyubiii iicctvy dicij.ii 


1 


74 


1 896 


cril 04171 SI 

gl 1 \JHO / I D 1 


XXUIIUJ bdpicllb 


rTYM A • VT T9 1 1 90 fk rlrmp 

{sLJlr*/\. StLjJAL LZ.\J Ills, L/IUIIC 

CAS05691. 


9S45 


QQ 


1826 


gi531204 


Hydra vulgaris 


myosin heavy chain 


107 


21 


1 QOA 
loZO 


glDDLoO 1 


i rypanosoma 
brucei 


iz protein 


1 o^ 

1Uj 


1 s 
1 o 


loZ / 


AArSoOOZJ 


riomo sapiens 


Human ri^rjJvrii-^o protein 
sequence oxiv^ iu inu.^j. 


Q1 & 

y io 


1 nn 


1827 


gi4929683 


Homo sapiens 


CGI- 107 protein mRNA, complete 
cds. 


183 


25 


1827 


gi7023866 


Homo sapiens 


cDNA FLJ1 1295 fis, clone 
Jrij/Yv^ri i uuy / z i , weaKiy similar to 
MSF1 PROTEIN. 


183 


25 


1 QOQ 

1 o/o 


gli JZ>4:5 1 


ivius mus cuius 


oimiiar to KiivriiN cuin/\ 
1300007L22 gene 


/OU 


^9 
OZ 


1 QOC 
1 SZo 


glO'+Z) JZ 


A.enopus laevis 


apical piotem 




'X'X 


1828 


gil773381 


Homo sapiens 


chromosome X clone U177G4, 

UlDZxlD, UlOolJjj 1 /^•/\0, Ul/ZUO, 

and U186B3 from Xp22, complete 

b c q uciiLrC . 


305 


38 


1829 


AAG01378 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5459. 


575 


100 


lozy 


gl 1 4Zj U4ZO 


riomo sapiens 


Clone llVl/VOxi. jOOOzjO, mrvlN/\, 

partial cds. 


1 OQ 
iuy 


9^ 

ZJ 


i szy 


gn uho /oy 1 


riomo sapiens 


curN/\. Jri^jzi^ou us, cione 

POT 0S0^4 


OQ 

yy 


9Q 

zy 


1 OZ)U 


gll J / JZ / JH- 


riunio bdpiciib 


7inp ■fiTifn^T' 1111 ml?'\T A rnnTnlAtp 

Ziiiiu liiigci liii iiirvrN/\, L/UinpicLc 

cds. 


867 


-JO 


1 ojU 


giiu^f jo / oy 


riomo sapiens 


pfilSjA ThT T14^zl^ -fie nlnnp 

ci_^iN/\. rLJi^tj'tj ns, cione 
THYRO 100 1189, weakly similar to 
ZINC FINGER PROTEIN 91. 


R69 
ooz 


DO 


1 a^n 


A ARQSR/^9 

A/\D7JOOZ 


nomo sapiens 


numan protein sequence ojj*v^ iu 


R69 
ooz 


DO 


1 879 




riomo sapiens 


iNcuron-associatea protein. 


1 ^61 

1 JU JL 


QS 


1832 


AAW29648 


Homo sapiens 


Human secreted protein BI1641. 


905 


100 


1 5 JZ 


giyozzz i y 


XV a ITU s 

norvegicus 


Deia-catenm omamg protem 


zz>y 




1 


crizl79SAA 
glHOZD il f*+ 


i ncnoprusia ni, 
larvae, Peptide, 
256 aa 


riourjy — zo Kaa trypsin protease 
homolog 


7zL 


41 
t -t i 


1 U J J 


ei26 18994 


JLJdLtlllUo J UU Llllo 


YoxB 1 


71 


37 


1833 


gi2634246 


Bacillus subtilis 


yoxB 


71 


37 


1834 


gi2394174 


Homo sapiens 


zinc finger protein ZNF191 
(2MF191) gene, complete cds. 


818 


56 


1834 


gi4405797 


Homo sapiens 


retinoic acid suppression protein A 
(RSG-A) mRNA, complete cds. 


818 


56 


1834 


gil3097726 


Homo sapiens 


zinc finger protein 24 (KOX 17), 
clone MGC:2057 IMAGE:3537160, 
mRNA, complete cds. 


818 


56 
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irlpnf itv 


1835 


gil3623571 


Homo sapiens 


clone IMAGE:4309033, mRNA, 
pamai cos. 


1639 


99 


1835 


gil 1967437 


Caenorhabditis 
eiegans 


suppressor of presenilin 2 


81 


35 


1835 


gi3425860 


Streptomyces 
coeiicoior 


ScbA protein 


102 


28 


1836 


gil3623571 


Homo sapiens 


clone IMAGE:4309033, mRNA, 
partial cds. 


705 


99 


1836 




bscnericnia coli 
0157:H7 


tail iiber protein 


QQ 

yy 


j / 


1 OQ/C 

loio 


gil lyo /4j5 / 


L/ aenoma d aitis 
elegans 


suppressor oi preseniiin z 


O 1 




1 Q11 

loi / 


' 1 r\A 1 '70*7 'J 

gllU43 /y /-5 


riomo sapiens 


cuiNA. rLJZioUi us, cione 
HEP00707. 


too 


77 


1 Q11 

lo3 / 


giiZLDzyoj 


riomo sapiens 


-ml? AT A • rOTlXTA TYteT^7r^ £ \f : \f\~\A^^A{\ 
irUvlN/V, CJJINA JJivi Z^p J DOlvl 1 u 

(from clone DKFZp566M1046); 
complete cos. 


1 oo 




1837 


gil0439143 


Homo sapiens 


cDNA: FLJ22665 fis, clone 
xioiucsziy. 


188 


26 


1838 


gil 3 940223 


Homo sapiens 


partial mRNA for doublesex-mab-3 
(JJJVLJ domain (ajivlkiaz genej. 


504 


46 


1838 


gi9743439 


Oreochromis 
niloticus 


sex-determining protein DMO 


408 


43 


1838 


gi6466208 


Mus musculus 


doublesex and mab-3 related 
transcription iactor i 


330 


59 


1839 


gil0433552 


Homo sapiens 


cDNA FLJ12145 fis, clone 

lVl/\lVlrvl/\ 1 UUU Dz/D . 


2946 


99 


1839 


AAB93901 


Homo sapiens 


Human protein sequence SEQ ID 


2946 


99 


1839 


gil 2804489 


Homo sapiens 


clone MGQ2771 IMAGE:2958853, 
mK-iN/\, complete cas. 


1797 


77 


1840 


gil 2804489 


Homo sapiens 


clone MGC:2771 IMAGE:2958853, 
mi\.iN/\, complete ccis. 


3096 


99 


1840 


gil 3 647045 


Mus musculus 


Sip homologue lacking C2 domains- 

a 


1845 


63 


1840 


gil0433552 


Homo sapiens 


cDNA FLJ12145 fis, clone 

lvl/\iVXlVJLrv 1 \J\J\JD y o . 


1791 


77 


1841 


gil2804489 


Homo sapiens 


clone MGC:2771 IMAGE:2958853, 
mi\j.N/\, complete cos. 


1167 


98 


1841 


gil0433552 


Homo sapiens 


cDNA FLJ12145 fis, clone 

Aif A A/TA/T A 1 nnfllO^ 


1771 


94 


1841 


AAB93901 


Homo sapiens 


Human protein sequence SEQ ID 

INvJ. Ijojj, 


1771 


94 


1842 


gil2804489 


Homo sapiens 


clone MGC:2771 IMAGE:2958853, 
mKJNA, complete cas. 


1788 


100 


1842 


gil0433552 


Homo sapiens 


cDNA FLJ12145 fis, clone 
MAMMA 1 OOO^OS 


2090 


99 


1842 


AAB93901 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13855. 


2090 


99 


1843 


gil 0434777 


Homo sapiens 


cDNA FLJ 12983 fis, clone 
NT2RP3000002. 


556 


100 


1843 


AAB94395 


Homo sapiens 


Human protein sequence SEQ ID 
NO:14962. 


556 


100 


1843 


gi2621988 


Methanothermob 
acter 


integrase-recombinase protein 


77 


31 
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thermautotrophic 
us 








1844 


gil0435272 


Homo sapiens 


cDNA FLJ13305 fis, clone 
kj v AJvL.- 1 uu i jV y . 


386 


35 


1 O A A 

1844 


A A DO/1 C/C/l 


Homo sapiens 


Human proiem sequence on^ xu 
NO:15341. 




3S 


1844 


gi9916 


Plasmodium 
falciparum 


liver stage antigen 


162 


21 


1845 


gil 1994722 


Arabidopsis 

4-1, « M n ~* ~ 

tnaliana 


contains similarity to 
caimoauiin~gene la.r jin j.iu 


151 


24 


1845 


gil4625425 


Nicotiana 
tabacum 


calmodulin NtCaM13 


141 


27 


1845 


gi535428 


Pisum sativum 


calmodulin-like protein 


137 


24 


1846 


gi70228ll 


T T „ * „ 

Homo sapiens 


cUJNA rLJ lUo4y lis, clone 
NT2RP2005835, weakly similar to 

o-tli 1 ri\U 1 -C/irN . 


HUD 




1846 


* An 0^2 AO 1 


Homo sapiens 


riuman proiem sequence or,^ ij_>» 




39 


1846 


gi2285790 


Rattus 
norvegicus 


p47 


386 


44 


1 O A *7 

184 / 


„: 1 (\a o COO 

gllU43 odIZ 


nomo sapiens 


CUiN/\ rJLJ IjH^u lib, dune 

PLACE1004118. 


731 7 


99 


1847 


AAB94676 


Homo sapiens 


Human protein sequence SEQ ID 

XTfV 1 ^^OA 


2317 


99 


1847 


AAR70784 


Homo sapiens 


Saposin-C. 


77 


30 


1848 


AAY59712 


Homo sapiens 


oecretea proiem o304-.L-.Dy-r l^i . 




1 on 

1 uu 


1848 


gi8979275 


Chlamydophila 
pneumoniae 

J lio 


muramoylalanine-glutamate ligase 


84 


26 


1848 


gi7189877 


Chlamydophila 

• 

pneumoniae 
AR39 


UDP-N-acetylmuramoylalanine— D- 
glutamate ligase 


84 


26 


i o A r\ 

1849 


gi9800509 


Homo sapiens 


pmcn-z protein ixiivi>/\, complete 
cds. 


K\ 1 


1 00 

1 VJVJ 


10/10 

1849 


AACj02299 


Homo sapiens 


xiuman secretea proiem, i.l' 
NO: 6380. 




-70 


1849 


gil3542b44 


Mus musculus 


KJ-lvrUN cjjjna ^yziDz^Auz gene 


LOO 




1850 


gil 043 5 023 


Homo sapiens 


cDNA FLJ13140 fis, clone 
xtto r? p ^ o nzi 


2219 


85 


1850 


AAB 94491 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15 180. 


2219 


85 


1850 


AAY73377 


Homo sapiens 


rllKiVL clone lo4Dy4l protem 
sequence. 


ZZIj 


1 no ; 

1 uu 


1851 


gil 425 0241 


Mus musculus 


bimilar to prostate tumor over 
expressed gene 1 


Z 1 D 1 


y\j 


1851 


gi7920398 


Homo sapiens 


PlOVl (PIOV1) gene, complete 


9 1 


OS 


1851 


AAB58992 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence SEQ ID 
700. 


1861 


97 


1852 


gil4588846 


Homo sapiens 


titin zinc-finger anchoring protein, 
50kDa isoform. 


1954 


96 


1852 


gil4588848 


Homo sapiens 


titin zinc-finger anchoring protein, 
60kDa isoform. 


1954 


96 


1852 


gi!4043532 


Homo sapiens 


clone MGC: 12836 


1876 


96 
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IMAGE:41 10783, mRNA, complete 






1853 


gi7022832 


Homo sapiens 


cDNA FLJ10661 fis, clone 
"NTT9RP90061 06 


451 


69 


1853 


AAB93042 


Homo sapiens 


Human protein sequence SEQ ID 

Tsjn-1 1 897 


451 


69 


1853 


gi 14603 247 


Homo sapiens 


Similar to RIKEN cDNA 

J / Ov/T-Uj/vjri j gciic, wiuiic 

MGC: 19636 IMAGE:2822323, 


254 


74 


1854 


gi3387911 


Homo sapiens 


clone 24408 2-oxoglutarate carrier 

nrntpiti mT? 7\T A r*r\~m"r\1 (^tf* pHc 

jJl U IC111 UllVlN xTt, ^/UJUJLUICLG LUS. 


1186 


100 


1854 


gil3676350 


Homo sapiens 


clone MGC:2449 IMAGE:2960933, 

ml?\r A r* m ~r\ 1 F^tf* pHc 
llJUtViM S\, L^UIIIJJICLC CUt). 


1186 


100 


1854 


gi 13676368 


Homo sapiens 


clone MGC:4128 IMAGE:2960933, 
mRNA, complete cds. 


1186 


100 




rr-i 1 ftA'XQ^Ci^ 

gi i u^jyouo 


nomo sapiens 


cuiN/\. rLJ/jUHZ us, cione 
LNG02323. 


o / o 




1855 


gi4699964 


Homo sapiens 


PAC clone RP5-953A4 from 

/ q 1 1 .z d-q£ l.i, complete sequence. 


107 


24 


1 ODD 


1 99Ann9s 

gl 1 Z.A c \\j\JAo 


iviyxo coccus 


translation miiiaiion iacior irz 




96 






JLJ.UJ.ilU iCtUlClli) 


(from clone DKFZp434N0615); 
■narfial cc\^ 

L/Ul UUl vUOi 


9976 


1 00 


1856 


AAB56293 


Homo sapiens 


Human secreted protein sequence 
encoded bv gene 99 SEO ID 
NO:387. 


715 


67 


1856 


gi2230871 


Homo sapiens 


H. sapiens mRNA for Miz-1 protein. 


502 


30 


J.OJ / 


ml 99^9^99 


JLJ.U1A1U odUieilo 


InT TP*F^ A mT?7SJA r*A7TTn1r a tf a ' r*Hc 

1J.U1 X JA 11JJLV1 > A, WWiIllJiei.C WU-O. 


9480 


100 

JL W W 


1857 


gil2620406 


Homo sapiens 


UPF3 (UPF3) mRNA, complete cds. 


2357 


100 


1 R^7 

lOD / 


A A V39 1 QA 


nomo sapiens 


nuuiaii lecepiui inuiecuie ^ivjc^j 
encoded by Incyte clone 266775. 


1 AIR 


1 no 

1 WW 




gll.ZoU.3H- 1.3 


xiomo sapiens 


cione iviov_/,z'+ iivi/\ojD. j ituj /z, 
mRNA, complete cds. 


Ol 1 


1 DO 

1 Uw 




AAB58965 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence oiiv^ lu 
673. 


500 


96 




gil4330030 


Mus musculus 


bM401L17.7.1 (novel protein 
^lsoiorm i )) 


379 


72 


1858 


AAT05577_ 
aal 


Homo sapiens 


Human p69 cDNA. 


544 


50 


1858 


gil4250411 


Homo sapiens 


islet cell autoantigen 1 (69kD), 
clone MGC:9360 IMAGE:3 854722, 
mKJNA, complete cds. 


544 


50 


1858 


gil674386 


Homo sapiens 


Human clone IS 10 diabetes mellitus 

lypc 1 aUlUctllllgCll ^lv^r\pDi/ J UllvlN r\ 7 

complete cds. 


544 


50 


1859 


gi 10434744 


Homo sapiens 


cDNA FLJ 12960 fis, clone 
NT2RP2005605, weakly similar to 
QUEUINE TRNA- 
RIBOSYLTRANSFERASE (EC 
2.4.2.29). 


2182 


100 


1859 


AAB94387 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 14945. 


2182 


100 
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c 

score 


X CI 

identity 


1859 


gi6977808 


Bacillus subtilis 


tKJNA-guamne transgiycosyiase 


1 77 

Lit 


97 


1860 


gil0436039 


Homo sapiens 


cDNA FLJ13881 fis, clone 
THYRO1001458, moderately 
similar to MYUolJN HbAV Y 
CHAIN, NONMUSCLE TYPE B. 


2624 


99 


1860 


AAB95605 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18299. 


2624 


99 


1860 


gil2653781 


XT ' 

Homo sapiens 


clone IMACjE:334935z, mKJNA, 
partial cds. 


zzoo 




1861 


gil2053347 


Homo sapiens 


mRNA; cDNA DKFZp586Ml 120 
(from clone DKFZp586Ml 120); 
complete cds. 


1005 


97 


1861 


gi4633066 


Homo sapiens 


protein phosphatase- 1 regulatory 
suounit 7 (RPFlR/j gene, exon 11, 
complete cds and alternatively 
spliced product. 


234 


42 


1861 


gi4633068 


Homo sapiens 


protein phosphatase- 1 regulatory 
subunit / (FJrrlK/j gene, exons o 
and 9 and alternatively spliced 
products. 


234 


42 


1862 


gi7022518 


Homo sapiens 


cDNA FLJ10466 fis, clone 
JN 1 zKJr 1 UU 1 00 D . 


488 


38 


1862 


AAB92852 


Homo sapiens 


Human protein sequence SEQ ID 

XT/™\. 1 1 A 1 H 

JNU: 1 141 /. 


488 


38 


1862 


gil0439049 


Homo sapiens 


cDNA: FLJ22601 fis, clone 

TTOTA/1 A H 1 

HMU44 / 1 . 


858 


99 


1863 


AAB70366 


Homo sapiens 


Human serine protease protein 
sequence o±iQ 1JJ JNU. 3. 


1365 


99 


1863 


gi6137097 


Homo sapiens 


serine protease DESC1 (DESC1) 
ixlkjna, complete cas. 


660 


48 


1863 


A A \,T\n A 1 A 

AAY99414 


Homo sapiens 


rluman fkaji^oi ^uini^/^zj ammo 
acid sequence SEQ ID NO:269. 




'-rO 


1864 


A A r'AOOO^ 

AACj02826 


Homo sapiens 


rluman se ere tea protein, o.ca^ iiy 
NO: 6907. 




QO 


1 O £1 A 

1864 


■ -« o**71 Dim 

gil271o793 


Yarrowia 
lipolytica 


iNjji protein 






1864 


AAB569D5 


Homo sapiens 


rluman prostate cancer antigen 
protein sequence SEQ ID NO: 1533. 




29 


1 Cl s~ c 

1865 


gi45 12103 


Bos taurus 


rao 1 1 Dinaing protein 


1 1 77 
11// 


R4 


1865 


gi6049150 


Rattus 
norvegicus 


WD-containing protein 


1027 


78 


1865 


gi7715587 


Streptococcus 

• 

pneumoniae 


PspA 


136 


26 


1866 


gil0439164 


Homo sapiens 


cDNA: FLJ22679 fis, clone 

UCT1 A/C0^7 

HS11068 /. 


1365 


100 


1866 


gi59578 


Human 
nerpesvirus h 


6.5kd unidentified reading frame (aa 

1 &CW 
1 -OU ) 


62 


34 


1866 


gil334920 


Human 
herpesvirus 4 


BNLF2a reading frame 


62 


34 


1867 


gil2652919 


Homo sapiens 


clone MGC:2803 IMAGE:2961319, 
mRNA, complete cds. 


927 


100 


1867 


AAY59763 


Homo sapiens 


Human normal ovarian tissue 
derived protein 40. 


877 


100 


1867 


AAB92974 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11683. 


551 


100 
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TTCf TT\ 
Jtlll 11^ 


opecies 


r>f»cor infirm 


« 

O V/ V/ ft \s 


V*prt*prit 
identity 


1868 


AAG03616 


Homo sapiens 


Human secreted protein, SEQ ID 
no- 7^07 


429 


96 


1868 


gil 1093536 


Homo sapiens 


interleukin 2 1 (IL21) mRNA, 


75 


25 


1868 


AAB 18623 


Homo sapiens 


A human zalphal 1 ligand 

puiypcp iiiic 


75 


25 


1869 


gi2 114473 


Mus musculus 


pl40mDia 


488 


29 




A A Win 
AAW /O /j54 


riomo sapiens 


xiuman miJia rvno Targeting protein. 








gizy4/zJo 


riomo sapiens 


Qiapnanous 1 ^nuju\i ) rinvi>/\, 
cunipieic uLit>* 






1870 


gil2804143 


Homo sapiens 


clone IMAGE:3953631, mRNA, 
paniai cas. 


2777 


99 


1870 


gi4432833 


Arabidopsis 
thaliana 


En/Spm-like transposon protein 


110 


29 


1870 


gil 2804045 


Homo sapiens 


clone IMAGE:3940843, mRNA, 
partial cds. 


116 


28 


18 /l 


gll.3oU.3o4j 


ivLus museums 


riDonuciease/angiogenin muiDiior z 


1 ^ 1 ^ 

1 Z> 1 o 


AQ 


1871 


gil 1096305 


Homo sapiens 


NALP2 mRNA, complete cds. 


802 


33 


1871 


gi7020664 


Homo sapiens 


cDNA FLJ20510 fis, clone 
KAT09662. 


802 


33 


1872 


gi7023341 


T T 

Homo sapiens 


r\\T A T'T T 1 AH/TO J~ _ _ 1 

cDNA FLJ 10968 lis, clone 
PLACE1000863, moderately similar 
to PUTATIVE MITOCHONDRIAL 

4Uo KibUbUMAL r'KCJ 1 rilJN 
YHR148W. 




ol 


1872 


gil5079901 


Homo sapiens 


Similar to mitochondrial ribosomal 
protem o4, clone MLrC lyoco 
IMAGE:3344519, mRNA, complete 
cds. 


536 


61 


1 O ^7 O 

lo /z 


gllZlOiJZl 


Homo sapiens 


rnKJNA, CJJJN/Y JJJsJrZvpDoOl^Ul 15 

(from clone DKFZp586L01 18); 
complete cds. 




o 1 


1873 


AAY73467 


Homo sapiens 


Human secreted protem clone 
yd61_l protein sequence SEQ ID 


AQA 
4^4 




1 QTJ 


A A "D /I </1 ^< 


riomo sapiens 


numan secreiea proiem jdju/\o i 
search protein SEQ ID NO: 150. 


D DO 


07 


1873 


gil3905260 


Mus musculus 


RIKEN cDNA 1300006C06 gene 


246 


50 


1 o /4 


1 Af\A A r\c\ 1 

gl!4U44UUl 


Homo sapiens 


clone m.ljLx.141 id 
IMAGE:4120858, mRNA, complete 
cds. 


zy i j 


QQ 
yy 


1874 


gil0121792 


Homo sapiens 


MJ0495-like protein SelB mRNA, 
partial cas. 


2704 


99 


1874 


gil0121790 


Mus musculus 


MJ0495-like protein SelB 


2662 


87 


1 one 


gi 10434992 


Homo sapiens 


-•pvXT A T7T T1 Q 1 lO -fir. y->1/^-»->A 

cDJNA r L>J I j 1 iy lis, clone 
NT2RP3002671, weakly similar to 
FT ONGATION FACTOR 2 


AAAC\ 


QQ 

yy 


1875 


AAB94484 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15 164. 


4440 


99 


1875 


AAB07853 


Homo sapiens 


Amino acid sequence of Smadl 
interactor protein clone SI +2 8. 


2574 


99 


1876 


gil2621066 


Homo sapiens 


RNA helicase-DEAD box protein 
RH1 16 iriRNA, complete cds. 


5309 


100 


1876 


gil 1344594 


Homo sapiens 


melanoma differentiation associated 
protein-5 (MDA5) mRNA, complete 


5285 


99 
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in i l iLf 


species 


uescnption 


c 

score 


Jrercent 
lucniiiy 








cds. 






Y O / O 


en 1 CiAH^Q^ 


riomo sapiens 


cdin/v ri^J i. 1-5j4 us, clone 
HEMBA 1000 129, weakly similar to 

C8A4.08C IN CHROMOSOME I. 


Oo4 


4U 


1877 


gi4929721 


Homo sapiens 


CGI- 126 protein mRNA, complete 
cab. 


613 


100 


1877 


gi6841532 


Homo sapiens 


HSPC155 


613 


100 


1877 


AAW74783 


Homo sapiens 


Human secreted protein encoded by 
gene D4 cione xiivi/\jjjuz. 


613 


100 


1878 


gi2230873 


Homo sapiens 


H. sapiens mRNA for M phase 
phosphoprotein 10. 


3398 


99 


1 C7C 




xiomo sapiens 


Human secreted protein sequence 
encoded by gene 46 SEQ ID 

XTTV 1 CIA 


2096 


100 


1878 


AAB63275 


Homo sapiens 


Human breast cancer associated 
antigen protein sequence SEQ ID 
NO:637. 


804 


98 


1 C70 


gllZUO J to 3 


Homo sapiens 


mRNA; cDNA DKFZp434G1415 
^rrom clone JJJsJ H Z < p434Cjl4i d), 
complete cds. 


1 134 


•A f\f\ 

100 




gi /uzuoiy 


no mo sapiens 


cuiNA rL,jZKjwD lis, clone 
KAT07835. 


O/l 1 

Z41 




1879 


gi2622648 


M e thanothermob 
acier 

thermautotrophic 

tic 
lib 


conserved protein 


103 


29 


1880 


gi506502 


Mus musculus 


NK10 


276 


68 


1 8 SO 
1 oou 


gl J. U't'tUUO 1 


xiomo sapiens 


cuina. ri^jzj4U4 lis, clone 
HEP 18862. 




/ZA 

o4 


1 OO v 


gl / UZZjZj 


nomo sapiens 


r»FlXTA T7T T1 0/l£Q fZc ^l^r^o 

ciJiN/\ x*jlj iu4oy ns, clone 
NT2RP2000008, weakly similar to 


ZJO 


T 1 
/ I 


1881 


gi 104403 71 


Homo sapiens 


mRNA for FLJ00020 protein, partial 


5999 


99 


1881 


gil 5077006 


Homo sapiens 


AT-hook protein AKNA mRNA, 
complete cus. 


2454 


79 


J. O O 1 




xiomo sapiens 


numan gene o j-encoaea secretea 
protein fragment, SEQ ID NO:285. 


y44 




1882 


gil0433398 


Homo sapiens 


cDNA FLJ12015 fis, clone 
HEMBB 100 1695. 


778 


100 


1 

1 ooz 


A AT3Q<Jf\1 Q 

l\J\r>yDU io 


Homo sapiens 


Human protein sequence SEQ ID 

1NU. IO /ZO. 


778 


100 


1882 


gi6562363 


Canis familiaris 


brain-specific synapse associated 
protein, ±3 as so on 


71 


40 


1883 


gil 1527289 


Homo sapiens 


LHX5 protein mRNA, complete cds. 


2174 


100 


1883 


ei53 1220 


i-VCl.il Ho 

norvegicus 


Q rninn apirl "pic» nfii • V« -f^-i /"I "m din 
CLlLllllKJ ClUlCl IC/dLtllC llUIXldJLlVJiilCtlll, 

bp 895 .. 1074; amino acid feature: 
LIM1, bp 373 516; amino acid 
feature: LIM2, bp 550 705 


9 1 


OR 


1883 


gil388183 


Mus musculus 


LIM/homeodomain 


2150 


98 


1884 


AAG01575 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5656. 


341 


82 


1884 


gi!4714594 


Homo sapiens 


clone MGC: 15955 
IMAGE:3538218, mRNA, complete 


56 


32 
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ID 


±111 lil 


species 


uescripuon 


c 

score 


x crceiii 
identity 








cds. 






1 OQA 

loo4 


'1 Q/tAI A/Z 

gll 34U140 


Homo sapiens 


Human gene tor apoierntm exons 
d ana 4. 


jj 


J4 


1885 


gil 1602755 


Mus musculus 


zinc finger protein 


1227 


70 


1 ooc 

1ooj> 


gll Id jo4oz 


Homo sapiens 


rnJKJN A tor rs-cell 
lymphoma/leukaemia 11A extra 
long iorm (oL^i-i ia-al genej. 




4U 


1885 


gill558488 


Homo sapiens 


mRNA for B-cell 

lympnoma/ieuKaemia 1 id ^.dcli i±5 
gene). 


298 


46 


looo 


glO /UJ70 / 


ivius museums 


pnospnonpase v^-i^z 


970 




1886 


gil 183 844 


Rattus 
norvegicus 


130kDa-Ins(l,4,5)P3 binding protein 


266 


40 


1 oou 


gl /oU 1ZZ 


riomo sapiens 


riuman i ilk in ja lot pnospnonpase 
complete cds. 


9^4 


zlO 


1 RR7 
loo/ 


gl I VJH J U D D U 




CL/lN.r\. I/J-jJ IHIt-O llo, dUIlC 

1VT A A/T A/f A 1 007047 


71 fx 

/ID 


1 00 


1887 


AAB95718 


Homo sapiens 


Human protein sequence SEQ ID 

1NU. 1 OJOZ. 


716 


100 


1887 


gi9928335 


Mycobacterium 
tuberculosis 


SEQ ID N_ 5A 


68 


33 


1888 


gil542813 


Mus musculus 


Six5 


2908 


85 


looo 


giozJUolo 


Homo sapiens 


SIX4 gene, exon 3 and complete 
cds. 


oUo 


^9 


1888 


gil255626 


Mus musculus 


AREC3 


798 


46 


1889 


gil 0242353 


Homo sapiens 


pellino 2 (PELI2) mRNA, complete 
cds. 


1636 


70 


1889 


gil4550457 


Homo sapiens 


pellino (Drosophila) homolog 2, 
clone MGC: 15066 
lMACjb:3942712, mKJNA, complete 
cds. 


1636 


70 


1 oon 

i ooy 


gl 11/2423 j j 


Mus musculus 


pellino 1 


i 1 
loJ 1 


/O 


1890 


gil2052983 


Homo sapiens 


mRNA; cDNA DKFZp434I1610 
(from clone DKFZp434I1610); 
complete cds. 


4179 


100 


1890 


gi6467206 


Homo sapiens 


GIOT-4 mRNA for gonadotropin 
inducible transcription repressor-4, 
complete cas. 


2071 


57 


1890 


AAB94388 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 14947. 


1976 


54 


1 on i 

isy 1 


gil20529oi 


Homo sapiens 


mRNA; cDNA DKI 4 Zp434I1610 
(from clone DKFZp434I1610); 
complete cds. 






1891 


gi5262560 


Homo sapiens 


mRNA; cDNA DKFZp572P0920 
(from clone DKFZp572P0920); 
partial cds. 


460 


56 


1 891 


gl 1 VJHOH-OJU 


noinu bapiCIlb 


pDMA FT T1 ^0^9 ftc n1rkn<=» 

NT2RP3001 120, moderately similar 
to ZINC FINGER PROTEIN 136. 






1892 


gil 0439 174 


Homo sapiens 


cDNA: FLJ22686 fis, clone 
HSI10987. 


2500 


99 


1892 


gi6224683 


Homo sapiens 


unconventional myosin- 15 mRNA, 
complete cds. 


759 


34 


1892 


gi6224685 


Mus musculus 


unconventional myosin- 15 


741 


33 


1893 


gi6453502 


Homo sapiens 


mRNA; cDNA DKFZp434B1917 


1639 


99 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


Percent 
identity 








^li uiii t/iuiic jL^iVirzjp i T«7*Ti-> x y i / j 3 

partial cds. 






1 QQ1 
1 OSO 


gllH-H-OOZ / / 


iionio sapiens 


aai-i misj.N/\ ior/-\_f\i-i aipua, 
complete cds. 


4S6 


1 00 

X \J\J 


1 QQ1 


gii^^f ooz /y 


xiomo sapiens 


a at 1 r-nPTsJA fr»r A AT 1 K/^tn 

complete cds. 


1 


1 00 




rri/^/l^^ ^fi7 


xiomo sapiens 


(from clone DKFZp434B1917); 

pd.1 llal Utlo. 


i \jjy 


yy 


1894 


gil4488277 


Homo sapiens 


aat-1 niRNA for AAT-1 alpha, 

r*f\mY\ 1 one 
CUIIipiCLC l/Llo. 


486 


100 


1894 


gi 1448 8279 


Homo sapiens 


aat-1 mRNA for AAT-1 beta, 
compieic tub. 


152 


100 


1895 


gi9963996 


Danio rerio 


eukaryotic translation initiation 
iacioi eii/Hn-i 


791 


75 


1895 


gi306487 


Homo sapiens 


cap-binding protein niRNA, 
complete cqs. 


778 


66 


1895 


AAY78505 


Homo sapiens 


Human general translation initiation 

IdUlUX C1JT*t1-» ailllilU dUlU. bCCJUClxL-C. 


778 


66 


1896 


gil0433134 


Homo sapiens 


cDNA FLJ1 1795 fis, clone 

XjH-zIVIID/t. 1 \J\J UIJJ. 


1549 


100 


1896 


AAB93831 


Homo sapiens 


Human protein sequence SEQ ID 

1 > KJ . X JUJv. 


1549 


100 


1896 


AAB53278 


Homo sapiens 


Human colon cancer antigen protein 


70 


40 


1 SQ7 
X 0¥ 1 


oil 04^1 1A 
gl 1 U*40 3 1 ^'t 




rF>"NTA FT T1 1705 fis rlnne 
\sUL^t£\. i IjJ 11/ y \j lib, uiuiic 

HEMBA 1006 155. 


1 558 


100 

X \J\J 


1 RQ7 




jTLorno bapiciib 


XlUiilClll jJlULClll bCLJ^UCllL/C OXiy 1-L>' 

NO-1 ^6S0 
i> V-/. xj\j^\jt 


1 558 

i J JO 


1 00 


1897 


AAB53278 


Homo sapiens 


Human colon cancer antigen protein 

<ipnnenrp SFO TF) NOS1 S 


70 


40 


1898 


gi9963806 


Homo sapiens 


zinc finger protein ZNF287 

^7TSJF9S7^ m"R~M A rnmnlpfp cc\<i 

yZ-iLy J_ \ZL O / J llliXl^^V, ^^JlilLflC'LC V/LLo. 


794 


59 


1 SQS 


ail 1 ^?7S4.Q 


TV/Tnc mncpnliic 
IVlUb lliU.oCU.lUlo 


r 7fnc "fiTiaf^f nTn'ffiTi ^"PTAT"? 

Zilllt llilgCl LJlULCiii OivrV 1 Z, 


776 


60 


1 SOS 




nuniu ocuJICilb 


v i ti r* fin apt* nrr»f<=»in ( 7 l-*'"P^ mT?7sJA 

ZdllL/ llilgC/1 ^JiULClll ^^i- JT ^ 1111.V1 N A, 


787 


44 


1 ROQ 
i Qjy 


aii ^nooi s 


T-Trnrin QfimpnQ 

I XUlllvJ ouUltllo 


mRTsTA For TR ^V)pta ponrnlptp rri*? 


1778 


100 


1 soo 


ail SI ^SS? 


T—T<~iTnr\ earn fMn c 

X1UX1XU OdLJXC'XXO 


Human NAK 1 mRNA for DNA 
HinHiriP nrntpin romnlete cds 

L/liiVllll& Ulv/Lvlll^ vvlllUivlv v^iu* 


1585 


100 


1899 


gi292834 


Homo sapiens 


Human TR3 orphan receptor 

rnRlSTA nomnlete cds 


1577 


99 


1900 


gi2276396 


Homo sapiens 


Human C2f mRNA, complete cds. 


1235 


100 


1900 




IV/Tnc miionnlnc 

IVlUo IXXlXOOLXXLlO 


C2F 


1136 


88 


1900 


gil2805095 


Mus musculus 


gene rich cluster, C2f gene 


1136 


88 


1 901 
i y\j i 


ai9?7^^06 

giZ,Z. / 0J70 


XiUillU oapiCllo 


l-Tnmnn ^~"?f* mT?T\JA fnmnlptp pHq 

llUilidli \^A,L liUvl>^V, ^U11XJJ1C/LC WVJO. 


797 


87 


1901 


gi2289907 


Mus musculus 


C2F 


722 


78 


1901 


ail 2805095 


A/Tns mnsculns 

XVXUO lllUoV* UXUO 


crpne rich cluster CU2f pene 


122 


78 


1902 


gi2696611 


Rattus 
norvegicus 


RNA splicing-related protein 


981 


84 


1902 


gi9837385 


Takifugu 
rubripes 


retinitis pigmentosa GTPase 
regulator-like protein 


181 


25 


1902 


AAB43954 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1399. 


150 


25 


1903 


gi4191594 


Homo sapiens 


protein serine/threonine phosphatase 
4 regulatory subunit 1 (PP4R1) 


503 


65 
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Liescnption 


c 

score 


identity 








mKJNA, complete cas. 






1903 


AAB73355 


Homo sapiens 


Human mesangial cell meg- 1 
protein. 


503 


65 


1903 


gi5824546 


Caenorhabditis 
elegans 


M04C9.6c 


238 


43 




AAWjyo /4 


riomo sapiens 


/vmino acio sequence oi mc li^in/^ 
clone CAT-1 (HTXET53). 




1 00 


1 C\ A A 


A A VAOQ O/C 

AAYOoizo 


nomo sapiens 


xiuman gianuiysin stdaa acuve 

rrct rrr\^ e^Tvi" 

iragmem. 






1904 


gi35065 


Homo sapiens 


Human NKG5 mRNA, expressed in 
naiurai iviiiei ociib ctixu. ± -L/Ciib. 


676 


100 


1905 


gi35065 


Homo sapiens 


Human NKG5 mRNA, expressed in 

natural 1<--| 1 1 p»t" r*^»1 1 C ann 1 'of^l 1 C 

ndlUial K111C1 UCllo allU. JL -v-Cllo. 


354 


100 


1905 


AAR23732 


Homo sapiens 


Gene 519 cDNA derived peptide. 


354 


100 


i r\ f\ c 

1905 


AAW59o /4 


Homo sapiens 


Amino acia sequence or me cltn/\ 
clone CAT-1 (HTXET53). 




1 \J\J 


1906 


AAR23732 


Homo sapiens 


Gene 519 cDNA derived peptide. 


327 


69 


1906 


gi35065 


Homo sapiens 


rluman JNJvCjj rnKJNA, expressea in 
natural Kiiier ceiis ana i -ceiis. 




o / 


1906 


AAW59874 


Homo sapiens 


Amino acid sequence of the cDNA 
clone LA 1-1 {r± 1 All 1 3o j. 


325 


67 


1907 


gil0439180 


Homo sapiens 


cDNA: FLJ22690 fis, clone 

TTCT 1 1 1 1 A 

riMl 1 134. 


1109 


100 


1907 


gi58 17053 


Homo sapiens 


mRNA; cDNA DKFZp586D0824 
(from clone DKFZp586D0824); 
partial cus. 


356 


45 


1907 


gil3569476 


Mus musculus 


immunity-associated nucleotide 4 


354 


44 


1908 


' 1 n /I T /! 1 T7 

gil0434l37 


TT„ ' 

Homo sapiens 


cUNA rl_J Izj / 1 lis, clone 
in i zi^jvi^-uuuyou. 


OQQC 

Z5S3 


QQ 


1908 


AAB94136 


Homo sapiens 


Human protein sequence SEQ ID 

XTfV 1 A A m 
INLJ. 144U1. 


2885 


99 


1908 


gi6807927 


Homo sapiens 


mRNA; cDNA DKPZp434I042 
^rrom clone uivr zjp^d^lv^+z), partial 
cas. 


1730 


99 




rri 1 HA A A^IO 


nomo sapiens 


IlliVlN /\ IOr ITI-jJv/WI lO piOlClll, palllal 
r*H c 


RQ1 

07 1 


100 


1 ono 


cri 1 A9 1 1 790 
gl 1H-Z 11/ zu 


xiorno bapiciib 


LlColllLiallll lIJXvlN .fA.} ^OllllJlC'lC' V/UO. 


104 


34 


1 QOQ 


cril A^SR£04 
gll^lOooOUH- 


WxyZd ballva 


■fro n c r'ri-nti nn 'f5ir*'l"0"r rmicrVi ^HpfitTi 0 
LidllaL'llJJliUll la^LUl iUUgll ollv^a ill 

HVp nrntpin 


79 


32 


1910 


gil0440383 


Homo sapiens 


mRNA for FLJ00027 protein, partial 


1272 


100 


1910 


gil4336762 


Homo sapiens 


16pl3.3 sequence section 7 of 8. 


1255 


99 


1910 


gil0801127 


Mus musculus 


JNK7SAPK-associated protein Id 


1243 


97 


1911 


gi4426613 


Mus musculus 


SLM-1 


1780 


95 


I9l l 


gi3 822555 


Mus musculus 


SJLM-Z 


1 1 

1 1 oO 


o / 


I9ll 


gi34 17603 


Mus musculus 


ETOILE 


1178 


67 


inn 


AA YzoOoz 


Homo sapiens 


rluman ppjyz j secretea piotem. 


94S7 




1912 


AAG03379 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7460. 


377 


100 


1913 


gi 104349 19 


Homo sapiens 


cDNA FLJ 13074 fis, clone 
NT2RP3001855, moderately similar 
to HOMEOBOX PROTEIN 
PKNOX1. 


1543 


98 


1913 


AAB94455 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15102. 


1543 


98 
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X CI Cell I 

identity 


1913 


gi 14043 520 


Homo sapiens 


clone MCjU: lz /y l 
IMAGE:40991 56, mRNA, complete 
cds. 


1 961 

1ZO 1 


61 

Ol 


1914 


gil043484l 


TT * 

Homo sapiens 


cDNA FU 13022 lis, clone 
NT2RP3000753, weakly similar to 

XTT7T 7T> /^\T7TT A A /fC\TT HPTD T"DT CT TJ 

N xi U KOr 1.LAJV1 Jb IN 1 1 Kir J^xi 1 xl 
rKU 1 xilJN . 


4 / 4U 


1 nn 

1UU 


1914 


AAB94420 


Homo sapiens 


Human protein sequence SEQ ID 

XT/^V 1 ^A99 

JNU: l jUZZ. 


4740 


100 


1914 


'zlOO/1 1 OA 

gi48o41o0 


. 

Homo sapiens 


rrixviN/v, cl/in/y uisjr z.p.j o^i lyio 
(from clone DKFZp564P1916); 
partial cub. 


1190 


1 on 


1915 


gi8925970 


Homo sapiens 


eIF4E-transporter mRNA, complete 


5059 


99 


1915 


gi9857633 


Mus musculus 


Clast4 protein 


4581 


89 


1915 


gil0437728 


Homo sapiens 


cDNA: FLJ21601 fis, clone 

V^UiAl /ZZ 1 . 


2997 


100 


1916 


giyo!) /odd 


mus museums 


t^.rasi4 protein 




OJ 


1916 


gioyzsy /u 


xiomo sapiens 


eir 4xi- nansponer niivi n j\, compicie 
cds. 




00 


iy lo 


gllU43 / /Zo 


xiomo sapiens 


^TYWA* TTT T91601 fic nlrmf* 
Ci/iNA. X'iwJZlOUl lib, L-IOIIC 

COL07221. 


90^^ 

JLKJDD 


OJ 


lyl / 


gll4 / 14V3 / 


xiomo sapiens 


cione ivioa^. i / oo / 
IMAGE:3865868, rnRNA, complete 

Ctlo. 


90S9 


i on 


1917 


gi2443272 


Mus musculus 


motor domain of KIF 1 2 


611 


88 


1 A1 n 

1917 


AAWZ70D3 


xiomo sapiens 


oecreteo. piotem Aojz. 






1918 


gi3071 66 


Homo sapiens 


Human mineralocorticoid receptor 
mRNA (hMR), complete cds. 


523 


79 


1918 


AAY21622 


Homo sapiens 


Ligand binding domain of nuclear 
receptor nMK. 


523 


79 


1918 


AAP80927 


Homo sapiens 


Sequence of the human 
mineralocorticoid receptor (hMR). 


523 


79 


1919 


gil395179 


Rattus 
norvegicus 


proteasomal ATPase (MSS1) 


922 


100 


1919 


gil2803525 


Homo sapiens 


proteasome (prosome, macropain) 
26S subunit, ATPase, 2, clone 
MGQ3004 IMAGE:3161790, 
mRNA, complete cds. 


922 


100 


1919 


gi2 19931 


Homo sapiens 


Human mRNA for MSS1, complete 
cds. 


yzz 


1 

1UU 


1921 


gi300482 


human, 
endogenous 
retroviral 
element RTVL- 
Hpl, Genomic, 

sapiens 


pol=reverse transcriptase homolog 
{retroviral element} 


501 


80 


1921 


gil99851 


Mus musculus 


pol protein 


355 


45 


1921 


gi927201 


Murine leukemia 
virus 


reverse transcriptase 


355 


45 


1923 


gil0862828 


Homo sapiens 


mRNA for IFRG15 protein. 


723 


99 


1923 


gil0862856 


Mus musculus 


15kD interferon alpha responsive 
protein 


716 


97 


1923 


gi7658271 


Ginglymostoma 


MHC class II protein beta chain 


41 


50 
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Wit in 






s 

score 


Percent 
identity 






cirratum 








f OlA 


gl4V jZ / J 


Kattus 
norvegicus 


rioosomai proiein oijd 


~> yj w' 


84 


1 QO /t 


/\/\r>D Joy j 


nomo sapiens 


n urn ciii coion caiicei dxiiigoii piuLcm 
sequence SEQ ID NO: 1235. 




84 


1 G9/1 

iyz4 




irioino sapiens 


xlUIIlcill UallCCI aoSUClO-LCU. |JHJLCIX1 

<?pnnpnre SFO ID NO- 1097 




84 


1925 


gil5029945 


Homo sapiens 


clone MGC: 17507 
TMArrF-^45^S76 mRNA comnlete 

cds. 


1700 


99 


lyz j 


^70901 40 

gl /UZU14U 


riUIIllJ odpiCUo 


HTNTA FT T90900 fiQ nlnnp 

GOLF 1206. 


545 


41 


1 oo ^ 
iyzo 


rri 1 96^4477 
gliZOZ)44 / / 


nomo Sapiens 


«i nnp TA/f A f^F -989^7^1 TnR"NrA 

ClOnc IIVJLtVvJxj.^O^ J / _) 1, ilxIVlN.rY, 

partial cds. 


47? 

"-VIA, 


41 * 


1 yzo 


gl3 jtoj 


IVAUo II1U&CU1U.O 


*7fr» 1 nrnfpin ( A A 1 -494 1 


1 7^S 


85 


1926 


gil5081398 


Homo sapiens 


kruppel-like zinc finger protein 
(ZNF300) mRNA, complete cds. 


965 


48 


lyzo 


1 AO A 1 A < 

gllUZU14j 


nomo sapiens 


riuman iJiN/\ oinamg protein 
(HPF2) mRNA, complete cds. 


Q9S 




iyz / 


gl4Z:0_> 1 


(Jryctoiagus 
cuniculus 


xvj Kerann 


1 906 


S4 


1 nn 

iyz / 


gl4J3iOo 


L^ryctoiagus 
cu.nicu.ius 


Kerann ivj 


1 906 


S4 


1927 


gil4250682 


Homo sapiens 


keratin 6 A, clone MGC: 10443 
IMAGE:3947610 } mRNA, complete 


1267 


53 


1928 


gil0437560 


Homo sapiens 


cDNA: FLJ21457 fis, clone 
POT 0470^ 


2563 


99 


1928 


AAB21000 


Homo sapiens 


Human nucleic acid-binding protein, 

TsTn ARP-4 

IN Ltrvxjx -*+ . 


1409 


100 


1928 


gi7239109 


Homo sapiens 


HSPC059 


1305 


49 


1 Q9Q 


gllU4oo /oy 


riomo sapiens 


^TiTSTA FT T1474^ fic o1r»np> 

THYRO 1001 189, weakly similar to 

Trwr" FTTsTOFR PROTFTN Q1 




yy 


1929 


AAB95862 


Homo sapiens 


Human protein sequence SEQ ID 

in w. l oyz.y. 


3435 


99 


1929 


gi 104403 98 


Homo sapiens 


mRNA for FLJ00032 protein, partial 
cas. 


2509 


80 


1930 


AAB43912 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO:1357. 


1604 


52 


1 n*a a 


* 1 AO A 1 A 

gllUZU14j 


Homo sapiens 


riuman uma Dinamg protein 
(HPF2) mRNA, complete cds. 


1 497 


49 




• -1 syco A 1 ^ A 

gl 1ZD841 jy 


riomo sapiens 


zinc linger protein zoo ^ZviNrzooj 
mRNA, complete cds. 


1 ^61 


41 
*+ 1 


1931 


gil3559680 


Caenorhabditis 
elegans 


contains similarity to beta- 
lactamases 


144 


25 


1 en 1 

1 J 1 


A ARSR799 


xxuiiiu bapiclib 


XJltcioL dXl\X U VaXICUl Luliu^l uoowviu t\^v_i 

antigen protein sequence SEQ ID 
430. 


80 


25 


1931 


gi623764 


Entamoeba 
histolytica 


elongation factor la 


72 


28 


1932 


gi7022392 


Homo sapiens 


cDNA FLJ10390 fis, clone 
NT2RM4000104, moderately 
similar to ZINC FINGER PROTEIN 
135. 


440 


43 
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luciiiiLy 


1932 


AAB92770 


Homo sapiens 


Human protein sequence SEQ ID 

JNU: 1 lz43. 


440 


43 


1932 


gil 5 100039 


Mus musculus 


zinc finger protein ZFEND 


438 


43 


1933 


gi9963806 


Homo sapiens 


zinc linger protein ZNrzo / 
(ZrJNrZo/) iTiKJNA, complete COS. 


A 1 A 0 

414Z 


1UU 


1933 


gil 1527849 


Mus musculus 


zinc finger protein SKAT2 


3432 


82 


1933 


gi2293535 


T T ' _ _ 

Homo sapiens 


zinc linger protein {Zjnrzv) rnKJN A, 
complete cds. 


1300 




1934 


gil0439752 


T T • 

Homo sapiens 


cDNA: tLjzM d / lis, clone 
LNG09620. 


1 OZD 


OQ 


1934 


gil3603873 


Homo sapiens 


1 J3F -associate a lactor ll (lAr/^j 
mRNA, complete cds. 


1 1 77 
11// 


yo 


1934 


gil3603831 


Mus musculus 


1 r>F-associatea lactor 11 


£7n 
O /U 




1935 


gi 1060973 


Oryctolagus 
cuniculus 


Potassium cnannel 


1 

1UJ 




1935 


gi5932024 


Oryctolagus 
cuniculus 


voltage-gated potassium channel 

JVVl .3 


105 


30 


1935 


gi3044206 


Oryctolagus 
cuniculus 


potassium channel Kvl.5 


105 


30 


1936 


gi4 150892 


Rattus 
norvegicus 


dermo- 1 protein 


810 


100 


1936 


gil098934 


Mus musculus 


Dermo- 1 


o 1U 


i nn 


1936 


gi42003 14 


Gallus gallus 


Dermo protein 


782 


96 


1937 


gil3752754 


Homo sapiens 


zinc finger 1111 mRNA, complete 
cds. 


2304 


58 


1937 


gil0440398 


Homo sapiens 


mRNA for FLJ00032 protein, partial 
cds. 


2294 


59 


1937 


gil0436789 


T T * 

Homo sapiens 


cDNA FLJ14345 lis, clone 
THYRO1001 1 89, weakly similar to 
ZINC FINGER PROTEIN 9 1 . 


20ol 


D9 


1938 


gil4787181 


Homo sapiens 


CUB and sushi multiple domains 
protein 1 short form mRNA, 
complete cas, alternatively splicea. 


Ton 




1938 


gil4794726 


Homo sapiens 


LUd ana susni multiple aomams i 
protein mRNA, complete cds. 


7SQ 


JO 


1938 


gil4787176 


Mus musculus 


CSMD1 


784 


54 


1939 


gil4787181 


Homo sapiens 


CUB and sushi multiple domains 
protein 1 short form mRNA, 
complete cds, alternatively spliced. 


1 £T 


JO 


1939 


gil4794726 


Homo sapiens 


CUB and sushi multiple domains 1 
protein mRNA, complete cds. 


256 


56 


1939 


gil4787176 


Mus musculus 


CSMD1 


255 


56 


1940 


gil2053079 


Homo sapiens 


mRNA; cDNA DKFZp434F03 1 8 

/ r" 1 T\T/'T7 r 7 /I O /l"T?/\0 1 O \ . 

(from clone DKFZp434F0318); 
complete cds. 


1212 


98 


1940 


gil3384259 


Homo sapiens 


apohpoprotem L6 mRNA, complete 
cas. 


134 


31 


1940 


•AAY35897 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 146. 


130 


32 


1941 


gil 2804053 


Homo sapiens 


clone IMAGE:3940029, mRNA, 
partial cds. 


5182 


99 


1941 


AAU00416 


Homo sapiens 


Human cell regulatory protein pi 93. 


2623 


66 


1941 


AAB43850 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1295. 


954 


61 


1942 


gi!2654005 


Homo sapiens 


Similar to ribosomal protein S9, 


840 


100 
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clone MGQ5482 IMAGE:3452221, 
mRNA, complete cds. 






1942 


gil3938523 


Homo sapiens 


clone MGC:2458 IMAGE:2964451, 
mRNA, complete cds. 


840 


100 


1942 


gil3938567 


Homo sapiens 


clone MGG4138 IMAGE:2964451, 
mRNA, complete cds. 


840 


100 
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TD 


Hit in 

XXI l IIS 


o pctics 


TlPQprinrifin 


s 

score 


Percent 
identity 


972 


ffil817578 

til 1 U 1 1 *s 1 \J 


Hnmn sanien 4 ? 


alnha-fflobin 2 


600 


85 


972 


eil817577 


Homo sanien*? 


alnha-fflobin 1 


600 


85 


972 


£i28558 


Homo s aniens 


readine frame alr>ha-&lobin 


600 


85 


973 


AAW62041 


Homo s aniens 


KYOW Human arsinase. 


1559 


94 


973 


AAR05306 


Homo sapiens 


TOYJ Human arginase. 


1559 


94 


97^ 


cnl 1 97498 

til 1 X.Z7 1 1 KJ 


Homo <jfloiPTi<i 




1559 


94 


974 


AAW62041 


Homo sanien*? 


TCYO^V^ Human arginase 


1552 


94 


974 


AAR05306 


Homo sapiens 


TOYJ Human arginase. 


1552 


94 


074 
y / h 


ail 1 Q74QR 


T-ToTYlfl C£>TM f*TI c 
X1U1X1U OClLJlC'llo 


C*T*0"iT15l CP 


1552 


94 

r 


915 


AAW62041 


Homo sapiens 


KYOW Human arginase. 


1538 


94 


975 


AAR05306 


Homo sapiens 


TOYJ Human arginase. 


1538 


94 


9 / D 


gii iy /4yo 


riomo sapiens 


• 

arginase 


lJJO 


04 


9/6 


A A Dfkl 1 *7<T 

AAByil /j 


riomo sapiens 


xixii^i- Human protein sequence oxiv^ 
ID NO: 121 14. 


zl01 9 
'H-y 1Z 


1 nn 


0*7/C 

9 /o 


gll4zl 1DUU 


riomo sapiens 


secretory protein oxiLxO 


1Z 


1 no 


9 / 0 


gllUiy44 J. 


ivatrus 
norvegicus 


rseco 




OS 


Q77 

y 1 1 


A A Til Q'ins 


xiomo sapiens 


X1LJ1V1/-V- jLlUlIld.il SCCXClCU. piUlClil 

sequence encoded by gene 48 SEQ ID 

MO* 1 RR 


0S4 


76 


077 

y i 1 


cri 1 4001 47S 


XXUJ.IJJJ oCtpiCIlo 


1i?mt"\/ r*p11 1 pi ll^'F'Tnifi irmtpin 1 
iictiiy L/Cii ic uivciiua ^jlkjiclil i 


954 


76 


977 


gil 1999092 


Homo sapiens 


TGF beta inducible nuclear protein 

TTKTP1 

1 IINI^ 1 


954 


16 


978 


gi20 198487 


Homo sapiens 


182kDa tankyrasel -binding protein 


6194 


99 


y /o 


gll OO /03 /H 


riomo sapiens 


irx-fjuu l o'H- proicin 


7747 

Z / *T / 


QQ 

yy 


Q7Q 


m* 1 78^/17 1 7 

gli / OO't III 


I\ /I i t c tni l o j^i lino 

ivius muscuius 


iiomenn 


1 RR 

1 OO 


91 
i 


979 


AAY13459 


Homo sapiens 


UYRQ Amino acid sequence of human 

X^COJ-lXK.C piOLCiXl. 


3643 


96 


979 


gil657752 


Homo sapiens 


FE65-like protein 


3643 


96 


y /y 


m" 1 n 777*3 9 

gll / / / jZ 


IvaiTUS 

norvegicus 




1 

lJJO 


to 


you 


A A V1 'XA^Q 


riomo sapiens 


u i xvv«£ /\iximo dciu. sequence 01 iiuiiiaii 

T7p^S 1 1 tM*ntpiti 
X^ CU J _ lXlvC JJIwLCXll. 




04 


980 


gil 657752 


Homo sapiens 


FE65-like protein 


3492 


94 


oon 

you 


glli3 / / 1 51 


jKattus 
norvegicus 


rxioj 


1 ^94 


*+o 


no 1 

yoi 


A ATQOin^ 

AA1 yZJU2)_ 

aal 


xiomo sapiens 


oALJs. LyOnstitutiveiy active receptor- 
alpha encoding cDNA. 


A'Xl 


OS 


no i 

yoi 


A A 1 nc\ 


homo sapiens 


I UL/A- Amino acio. sequence ot numan 
CAR- a polypeptide. 


A^?7 
*f J / 


OR 


no 1 

yoi 


a a n^mono 

aa wyjyuz 


riomo sapiens 


Lrxixiu Human lak receptor protem. 


All 


QQ 


982 


gil9911227 


Homo sapiens 


2-amino-3-carboxylmuconate-6- 
semiaiaenyae aecaiooxyiase 


1438 


95 


982 


gil8652911 


Rattus 
norvegicus 


2-amino-3~carboxymuconate-6- 
semialdehyde decarboxylase 


Mil 


82 


yoz 


rri 1 QQ 1 1 11 1 

guyy i izj i 


L/ aenorna u diti s 
elegans 


z-amino-j-carDOxyimuconate-o- 
semialdehyde decarboxylase 


OOVJ 


49 
*+z 


983 


AAM48092 


Homo sapiens 


SUMU Human IRG27 protein SEQ ID 
NO 1. 


786 


88 


983 


AAM38928 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 2073. 


786 


88 


983 


AAB24220 


Homo sapiens 


SUMU Human immortalisation related 
gene 27 protein sequence SEQ ID 
NO:l. 


786 


88 
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ID 


Hit ID 


species 


F\nr/>i*inf J/111 

uescripuuii 


score 


A. CI V/Cll L 

identity 


no a 

984 


glzU4o /UOO 


riomo sapiens 


tniamine trrpnospnatase 


son 


1 00 

JL \J\J 


984 


gi20467068 


Bos taurus 


thiamine triphosphatase 


445 


88 


984 


gi20467070 


Mus musculus 


thiamine triphosphatase 




DO 


985 


AAB93694 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO:l J2oz. 


764 


49 


985 


AAM41100 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO oUo 1. 


764 


49 


985 


AAM39314 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 2459. 


764 


49 


f~\. a?" 

986 


AAO02579 


Homo sapiens 


HYSE- Human polypeptide bLQ ID 
NO 16471. 






986 


AAO02491 


TT 

Homo sapiens 


HYSE- Human polypeptide 1L> 
NO loioi. 


111 
DO 1 


jj 


986 


ABB11103 


Homo sapiens 


HYSE- Human tankyrase-1 
nomologue, oLi^) 1JJ in 0.14/3. 


302 


78 


987 


gil4456631 


Homo sapiens 


dJ54B20.4 (novel KRAB box 
contammg CZrlz type zinc linger 
protein) 


3439 


100 


987 


A A TT1 CA/1A 

AAU 15940 


Homo sapiens 


rlUJVlA- riuman novel secreted protein, 
Seq ID 893. 


ZU1 0 


1 00 


987 


gil020145 


Homo sapiens 


JJJN A Diiiaing protem 




zlO 


988 


AAM95265 


Homo sapiens 


rlUMA- Human reproductive system 
related antigen SEQ ID NO: 3923. 


ZOH 


yo 


988 


gi5901659 


Caenorhabditis 
elegans 


XNP-1 


142 


20 


988 


gi3253105 


Caenorhabditis 
elegans 


C. elegans XNP-1 protein 
(corresponding sequence B0041.7) 


142 


20 


989 


AAU00670 


Homo sapiens 


MILL- Human lANuU 229 
poiypeptiae. 


1 AO A 


OU 


989 


gi20330504 


Homo sapiens 


CLCP1 


440 


31 


989 


gi20988615 


Homo sapiens 


oimilar to endotnelial and smootn 
muscle cen-derived neuiopinn-iiKe 
protein 


A A Ci 


1 

j i 


nriA 
99U 


AAJdJOOIJ) 


riomo sapiens 


KUo-c/ riuman prostate cancer antigen 
protein sequence SEQ ID NO: 1191. 


1 1 90 

1 1ZU 


1 00 

1 uv 


990 


giIozo4Z 


riomo sapiens 


rapamycm-Dinduig protein 


1 1 OQ 


7U 


AHA 

990 


glloz044 


riomo sapiens 


ri^juo-DinQing protem z j> 


i i 


1 00 

1 \J\J 


no 1 


A ATTAQC07 


riomo sapiens 


iin^ i - numan moumcanon 

t-»T*r\f^in "HAT A A /IP 9 

protem, i^iNjr\ivir-z. 


RQ4 

07 t t 


11 


991 


gil2746410 


Mus musculus 


coenzyme A diphosphatase 


507 


48 


nn i 

yy i 


rv-I 1 A'7/C/l OCA 

gllU /04o3U 


Arabidopsis 
thaliana 


r 1JS.Z J. 3 


117 
11/ 


^1 

D 1 


yyz 


A A "D O 

AABDOOio 


Homo sapiens 


KUoJi/ riuman prostate cancer antigen 
protein sequence SEQ ID NO: 12 16. 


ZjO 1 


01 

y i 


992 


' c 1 7A/CC 

gijl/Ooj 


riomo sapiens 


cnapeionm-iiKe protem 


9^5 1 
Ldo 1 


01 

y i 


992 


gil4348900 


Homo sapiens 


heat shock protein 


2381 


91 




rfi 19 02/1*79 

guzyo4 /z 


ivius musculus 


zmc linger protein 


oz>o 


y i 


993 


gil4329524 


Homo sapiens 


dJ871E2.1 (novel protein (ortholog of 
mouse zinc finger protein Zan75)) 


545 


96 


993 


AAG75641 


Homo sapiens 


HUMA- Human colon cancer antigen 
protein SEQ ID NO: 6405. 


364 


97 


994 


AAB95791 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 18751. 


3362 


94 


994 


AAB95291 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO:17513. 


3359 


93 
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SEQ 


Hit ID 


Species 


Description 


S 


Percent 

iHpnHtv 
men \,x i y 


994 


AAM93796 


Homo sapiens 


HELI- Human polypeptide, SEQ ID 

"NTfV ^R97 


3351 


93 


995 


gi2736284 


Mus musculus 


Ldbla 


2224 


99 




rM9^£9*\99 
glZOoZ JjLL 


f^-O 1 1 T 1 fl linn 

ijanus ganus 


neural sre interacting protein, long 
form; NSIP long form 


zzuu 


OR 




glO 1ZO / y 1 


riomo sapiens 


rsuciear i^iivi mxeraciui 


917^ 

Z 1 / J 


1 no 




J\J\r>OZZVZ 


riomo sapiens 


KJ.ijr.ts- L^eii cycle proiem jrxaon-isoiorm 
2. 


Ol / 

f 




on/; 


A A T3 A99m 


riomo sapiens 


iviw-t!/- t^eii cycle protein rvaan-isoiorm 
1. 


£1 7 
Ol / 


1UU 


990 


glzUU /Z4o i 


Mus musculus 


oimiiar to kijsj3JN cuin/y i /uuuiyuuo 
gene 


/ 


ou 


997 


AAU75578 


Homo sapiens 


UYNA- Human ubiquitin specific 

M ,.„i Q „ _ _ 1 A /T TOT) 1 rv\ 

piotease lu ^Uor'iuj. 


6817 


100 


997 


gil3560797 


Homo sapiens 


ubiquitin specific protease 


6817 


100 


997 


AAM40478 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 5409. 


4801 


99 


r\ f\ o 

998 


A A 1 — 1 1 1 O O C 

AAE13285 


TT 

Homo sapiens 


INCY- Human transporters and ion 
channels (TRICH)-12. 


511 


62 


r\ r\ o 

998 


gi|2072966| 
gb|AAC512 
72.1| 


T T ' _ 

Homo sapiens 


p40 


366 


4z 


999 


gil7429686 


Ralstonia 
solanacearum 


PUTATIVE D-LACTATE 

DbHYDKOOrbNAbJi 

(CYTOCHROME) 

UAlJUUKJtilJ U CI Aori JrKAJ 1 rsiJN 


1217 


51 


999 


gil7982522 


Brucella 
melitensis 


D-LACTATE DEHYDROGENASE 

(LY 1 UUWKvJlvlli) 


1148 


47 


999 


gil7740510 


Agrobacterium 
tumelaciens sir. 
C58 (U. 
wasningtonj 


FAD dependent oxidoreductase 


1147 


48 


1000 


AAB36613 


Homo sapiens 


INCY- Human FLEXHT-35 protein 
sequence ori^ iu invj.^j. 


528 


71 


1000 


gil4603247 


Homo sapiens 


Similar to RIKEN cDNA 5730409G15 
gene 


528 


71 


1000 


AAB93042 


Homo sapiens 


HELI- Human protein sequence SEQ 

11J JNVJ. 1 ioZ / . 


220 


84 


1001 


AAO 10631 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 

INvJ Z40ZJ. 


186 


72 


1001 


gil3421401 


Caulobacter 
crescentus CB15 


MutT/nudix family protein 


133 


40 


1001 


ABB05692 


Homo sapiens 


GEHU- Human amygdala derived 
piotem clone amyz zn /. 


112 


29 


1002 


gi4689229 


Rattus 
norvegicus 


b-tomosyn isoform 


2075 


94 


100? 


Kl— ' / > v J O J/ 


XVcl LLLla 

norvegicus 


ill— lAJinuo yii 


1 976 




1002 


gi4689231 


Rattus 
norvegicus 


s-tomosyn isoform 


1896 


94 


1003 


AAM40991 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 5922. 


365 


83 


1003 


AAM39205 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 2350. 


356 


82 


1003 


AAO07159 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 


328 


75 
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ID 


Hit TD 
mi 11/ 


optjcica 




q 


oi* f* £> n + 
l Ml IsCll L 

iH fii ti tv 








NO 21 051 






1004 


AAY21631 


Homn ^ar>iPTic: 

XXVJLLIXJ OClL/lVllO 


RTTtP T icranH 1^^TlH^TlO' , Hnmfiin r\f 

AVUVJV^ J_rl^dllLl UUIU.111^ UU1J.1U1J.1 KJX 

nurlear rprRntnr liTTRHeta 




1 00 

1 \J\J 


1004 


AAP80921 


Homo sanipn*? 


SAT,TC Sennenrp encoded Hv human 
olacenta thvroid receotor c-erbA 
cDNA. 


497 


1 00 


1004 


gi31207 


Homo saniens 


out tlivroid hormone receotor 

i— ' \> » l-X X T X V-^ X Vi- 11V X X X iwllv X %p/ ^— ' W W X 


497 


100 


1005 


gi3 1 92954 


Homo sapiens 


ataxin-7 


1065 


35 


1005 


gi2370155 


Homo s aniens 

X. X X X XY»S 1J fc>X k-/ X W liU 


soinocereliellar ataxia 7 


1064 


35 


1005 


gil9071468 


Mus mus cuius 


ataxin-7 


1020 


35 


1006 


si3 192954 


Homo saoiens 


ataxin-7 


1250 


40 


1006 


gi2370155 


Homo saniens 


sninocerplipllar ataxia 7 


124Q 


40 


1006 


gil9071468 


Mus mus cuius 


ataxin-7 


1186 


40 


1007 


AAU 16630 


Homo saoiens 


FTTJA/rA- T-Tnmflti novel secrfitpH nrntein 

Seq ID 1583, 


1 106 


97 


1007 


AAU16631 

x Li x. V—/ X V/V/^/ X 


Homo saniens 


TTFnVTA- Human nnvpl Qpprptpd "nrofpin 

Seq ID 1584. 


970 


99 


1007 


AAU 16223 


Hnmn QanipnQ 

±X\JXXX\J OClUlt'llO 


T-TT TA/T A — TTnm z\ n nr^t/pl cpr'Tptpd TTrotf^in 
J- X \^ IVX^cV llU.llla.ll 11UVC1 oCUlvlCU UJ.UlCXll ? 

Sea ID 1176 


047 


99 


1008 


ei38482 


Homo saniens 


rvrotp.a sottip Qiiniinit T A/TP7 

h-*X \J LvUijVJlllv OUUUllll 1 J±VXX / 


1 399 


83 

0«y 


1008 


2il054747 


Homo saniens 

XXVJXXXKJ JUUXVllJ 


altPTTiativp fiT<?t pxriti ^1H^ 


1 399 




1008 


gi596140 


Homo sapiens 


proteasome subunit LMP7 


1369 


82 


1009 


AAB93756 

i\ilXV i \J 


Hfitnn «ianif*n«! 


HThT T— TTiiTnnn nT , <rf"f a iTi opnnpnfp ^ITh^^ 
xxx-iX^iX~ X1U.H1<111 jJlULClll bCL[U.011L'C OJDv<J 

ID NO: 13429. 






1009 


eil4165506 


Hornn QanipriQ 

X XVJXiXKJ OCXL/lL/llO 


OT^A mptlTvltTa'ncfpTn cp 1 — jiocr^fin+f^H 
iviNri. iiicLiiy iLidiioicidoC I "doouuiciiCLi 

protein 1 




96 


1009 


&i9309471 

t~x w s yj _y r / x 


XXVJlllvs OUUlt'lliJ 


y~i"N^V/^ ^ T , 1 pccr^f^iJitpH TTm+f^in-l 

-L/1N1VJL X 1 adoUUldlCU. IJXLILClll"" 1 




06 
yo 


1010 


gi6942267 


Danio rerio 


bHLH transcription factor Mesp-b 


78 


27 


1010 

lulu 


cri 1 7^0760 


DUo LaLllUb 


aggrecdn 


i& 
/o 


zo 


1010 


gi3395774 


Aplysia 

palifmTiiPn 

1/CllllVJlllllsCi 


G-protein-coupled 5- 

liytllUAy IxyyjlclxLxLLlC ICUCpiUI 


72 


29 


1011 


AAU17585 


Homo sapiens 


HUMA- Novel signal transduction 


1010 


86 


1011 


AAU17212 


Homo sapiens 


HUMA- Novel signal transduction 

natliwav "nrntpin ^»f=»n JYm 11 H 
uciiii way ljujic/iii, ocu xxj 1 / /. 


1010 


86 


1011 


si5815353 


Hnmn ^anipriQ 


T dntnain pnntaininor •nrrvtpiti 1 icnfnrm 

J V-ivJllltlll-l L/UllLdlllULlg IJlUtClli J. 1£>UXUI111 

a 


1 01 0 




1012 


gil6741189 


Homo s aniens 


Similar to nuclear nrelamin A 
recosnition factor 

X VVUC.H1 HV/11 lUv V \JX 


2143 


100 

X \J\J 


1012 


AAB92498 


Homo s aniens 


HHT^T- Human nrntpin spmienre SPO 

X IxvxjI J. x-LXXXXtlXJ, Ul U Ivlli Ovvj Livllvv UxvW 

ID NO- 10602 


2084 


89 


1012 


AAB97260 


Homo sapiens 


SHAN- Human NADP hydrogenase 
subunit 50 

KJ VX L*l XX V \J m 


2084 


89 


1013 


gi5917666 


Zea mays 


extensin-like protein 


451 


27 


1013 


gi 15 145793 


Sus scrofa 


basic proline-rich protein 


422 


27 


1013 


gi600118 


Zea mays 


extensin-like protein 


394 


26 

Xw W 


1014 


gi407468 


Mus musculus 


SEB4 


1166 


93 


1014 


gi407419 


Homo sapiens 


SEB4D 


1141 


96 


1014 


gil9851930 


Homo sapiens 


CLL-associated antigen KW-5 


1134 


100 


1015 


AAG81328 


Homo sapiens 


ZYMO Human AFP protein sequence 
SEQ ID NO: 174. 


1562 


100 


1015 


AAB75532 


Homo sapiens 


ROSE/ Human secreted protein 
sequence encoded by gene 27 SEQ ID 
NO:86. 


680 


92 
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Percent 
identity 


1015 


gi9802536 


Arabidopsis 

t n 5m i n n £i 

LXldXlCtild 


F17L21.25 


305 


24 


1016 


AAB43450 


Homo sapiens 


HTJMA- Human cancer associated 
nrotein <?ermence SEO ID NO* 895 


779 


98 


1016 


AAM69101 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein SEQ 
ID "MO- 90407 


132 


100 


1016 


AAM56723 


Homo sapiens 


MOLE- Human brain expressed single 
exon probe encoded protein SEQ ID 

NO- 9RK98 


132 


100 


1 m 7 




T— T/^\ W1 r\ CQTMPT1C 


T-TTTIV/TA- TTnmnn r^Ticpt* siQcopiafpfl 
nrotpin <;pniipnpp ,SEO TD NO' 895 


723 


76 


1017 


AAM69101 


Homo sapiens 


MOLE- Human bone marrow 

pvnrp<?*5pH "nrobp ptipoHpH tTrotftin SEO 

ID NO: 29407. 


132 


100 


1017 

X \J x / 


AAM56723 


Homo sat)iens 


MOLE- Human brain exnressed sinsle 
exon probe encoded protein SEQ ID 
NO: 28828. 


132 


100 


1018 


gil2653673 


Homo sapiens 


tubulin, gamma 1 


2329 


97 


1018 


gil 83703 


Homo sapiens 


gamma-tubulin 


2320 


96 


1018 


gi4 115724 

t . -A- ■ A ^ -w- 4 ■ I F ■ 


Rattus 
norvegicus 


tubulin 


2304 


96 


1019 


gil2653689 


Homo sapiens 


Similar to aspartyl-tRNA synthetase 


2586 


100 


1019 


ail79102 


Homo sapiens 


asnartvl-tRNA synthetase 


2545 


98 


1019 


gil4250408 


Mus musculus 


Similar to aspartyl-tRNA synthetase 


2521 


96 


1020 


gil 8448004 


Homo s aniens 


nuclear associated nrotein 


2867 


100 


1020 


AAM93729 


Homo sapiens 


HELI- Human polypeptide, SEQ ID 
NO: 3689. 


2859 


99 


1020 


gi 184609 18 


Mus musculus 


nuclear protein NAP 


2756 


95 


1021 


AAU91297 

X XX X * ' S X M _V / 


Homo saniens 

X±V1HV/ UUL/iVUJ 


CORT- Human nrotein NOV4 


1744 


100 


1021 


AAB73671 


Homo sapiens 


INCY- Human oxidoreductase protein 
ORP-4. 


1744 


100 


1 071 


A ATT7Q001 


XXVJllltJ odlJltXlo 


POT^T- Human NOV4 variant nrotein 

V^V/lv 1 XXLAlJ.lu.il J.N v/ V " V Cll IdllL UlUH/ill. 


1743 


99 


1022 


ABB04369 


Homo sapiens 


MILL- Human 32225 polypeptide. 


1834 


100 


1 099 




no mo adpienb 


XXXJ/IjI- JTLUIIldll piOlCilX bCL[UCIlL'C OBy 

ID NO: 14807. 


1 81 Q 
i o x y 


QQ 




A A 0671 47 


jnuniu baUiCilo 


rW^~ , *V_ A tti in a nnrl opniipn^p r\'f c\ 
XINV^ I — ^VilillXU ctL>lU oCL^U.CXXL'C Ul d 

human enzyme. 


1 031 

X VJ X 


*j ~> 






JTlvJIllU oapiCUo 


A /TOT "p Wnman l^rnin pvnrpQ<;pH «:incr1p 

IVlvyljlj XXIXlilCtlX L/i alJJ tAL'l. too t^Ll olllglv 

exnn nrobp pncoded nrotein SEO ID 
NO: 30248. 


612 


100 


1023 


eil7861968 


T^ro^onhila 

melanogaster 


LD03052n 


197 


50 


1023 


si4996365 


A rahi H on<ii 
thaliana 


nolvnrotpin 


78 


31 




AAB93824 


Hnmn QpnipnQ 

XXVJlliVJ OdLJlt^llO 


"HET T- T-Tiimnn nrotpin <5pnnpnpp SEO 

11JUJL/1 XX.LAXX1CIXX UlUlvill O^UU^IUjU UJ-/V^ 

ID NO: 13634. 


719 


100 


1024 


gil 8605620 


Mus musculus 


similar to proline-rich protein 48 


86 


38 


1024 


gi5902891 


Streptomyces 
avermitilis 


type I polyketide S3mtliase AVES 1 


81 


32 


1025 


ABB50184 


Homo sapiens 


INCY- Human transcription factor 
TRFX-35. 


1452 


57 


1025 


AAB93164 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO:12091. 


1372 . 


58 


1025 


AAM40308 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 


1358 


49 
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1026 


gi 11386005 


Homo sapiens 


hepatocellular carcinoma-associated 
protein hcaiu 


2802 


100 


1026 


gil7981589 


Homo sapiens 


EH domain-containing protein-4 


2802 


100 


1 AOiC 

lUzo 


n i 1 "3 A O 1 OQA 

giiiUz lyoU 


Homo sapiens 


hepatocellular carcinoma-associated 
protein HCA1 1 


ZoUZ 


1 AO 
1UU 


1027 


AAB64400 


Homo sapiens 


INCY- Amino acid sequence of human 
intracellular signallrng molecule 
INTRA32. 


1238 


95 


1 An 
102 / 




Homo sapiens 


HJiLl- Human protein sequence oJa^ 
ID NO: 16446. 


lZ3o 


yj 


1 AT7 
lUZ / 


A A "Dm OQQ 


ilomo sapiens 


Hxii^i- Human pro tern sequence o±ii^ 
ID NO: 14087. 


1 OIQ 
1Z35 


yj 


1 AO O 

lUzo 


A A "D fZA A A A 


Homo sapiens 


1JNCY- Amino acid sequence 01 Human 
intracellular signalling molecule 

T\TTD All 

liN 1KA3Z. 


1 1 OA 

1 1ZU 


Q Q 
OO 


1028 


AAB94943 


Homo sapiens 


HELI- Human protein sequence SEQ 
1JJ N(J: lo44o. 


1120 


88 


1 AO o 

lUzs 


AAByiyoy 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 14087. 


1 1 OA 

1 1ZU 


OO 
OO 


1 mn 


giizoy /4 / / 


Jrlomo sapiens 


clJo j Uiiy.Z (novel protein similar to 
Drosophila CG6762) 


£QA 


1 AA 
1UU 


iuzy 


rr-I 1 <Q'7'7/I OA 

giioo / /4y4 


Homo sapiens 


oimiiar to IvlivlilN CJJINA 1 /UUIZ /r>U4 

/t a to ^ 

gene 


0 / iS 


1 AA 
1UU 


1029 


gil 5030 144 


Mus musculus 


Similar to RIKEN cDNA 1700127B04 
gene 


613 


88 


1030 


gi20987276 


Mus musculus 


RIKEN cDNA 5730409F23 gene 


1778 


99 


1 ai a 


rri 1 7C 1 AO'7 

glj.35 1UZ / 


Homo sapiens 


pnospnori d o sypyropno spnate 
synthetase-associated protein 39 


1 / / J 


QQ 


1 at a 


gi4jo / / y 


Kattus 
norvegicus 


pnospnori Dosyipyropnospnate 
synthetase-associated protein (39 kDa) 


1 /DO 


no 


1031 


AAB35408 


Homo sapiens 


HOSP- Human 07CG27 gene protein. 


619 


28 


i c\i i 
1U31 


AAByiyyi 


Homo sapiens 


Hr/Ll- Human protein sequence oUC^ 
ID NO: 14095. 


Alt 

411 


O 0 


1 ao. 1 


A A OA/1 1 czr 
AAUU4o JO 


ilomo sapiens 


H Y o.b- Human poiypeptiae bii^ iu 

XTO 1 QOAQ 
1NL/ loZ4o, 


inn 
off 


QQ 


1032 


AAB95299 


Homo sapiens 


HELI- Human protein sequence SEQ 

1JJ JNW. 1 /D3U. 


2238 


99 


1032 


AAB67054 


Homo sapiens 


INCY- Human immune response 
molecule (IJviujnj pro tern oiiv^ iu jnu: 

Q 
O. 


1414 


98 


1 AQO 


A A ft/TQ^/l 
AAiVtyj40D 


Homo sapiens 


HUiviA- Human reproauctive system 
reiaiea anugeii oxi^ il» inlj. ^fizo. 


Id j 


y i 


1033 


gil5809588 


Homo sapiens 


linRNP Q2 


3140 


99 


1 m i 


a a c±ha a <: 


Homo sapiens 


HUiviA- Human colon cancer antigen 
protein oiiv^ ii^ iN^.Djuy. 


1 1 AA 


OA 

y4 


1033 


AAY70242 


llUHlu OUUlVllO 


TMCY- 1-Tiiman RNA-a<5Qnr , iatP k H 

XJ. > X J.1U111U11 XVLIiT. (tOOUwlHluU 

protein-23 (RNAAP-23). 


3100 


94 


1034 


gi285933 


Homo sapiens 


Mel- 18 protein 


1811 


97 


1034 


gil8999362 


Homo sapiens 


zinc finger protein 144 (Mel- 18) 


1811 


97 


1034 


gil9548760 


Mus musculus 


mel-18 protein 


1738 


93 


1035 


AAZ21227_ 
aal 


Homo sapiens 


UYUP- Human CG1CE short form 
cDNA sequence. 


859 


53 


1035 


gi3598876 


Homo sapiens 


vitelliform macular dystrophy protein 


859 


53 


1035 


gi3335159 


Homo sapiens 


bestrophin 


859 


53 
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1036 


gi|19526874 
[reflNP 598 
464.11 


Mus musculus 


RIKEN cDNA 1500026F15 


208 


66 


1036 


gi|17550864 
|ref|NP 509 
196.1| 


Caenorhabditis 
elegans 


C36B7.6.p 


70 


33 


1037 


AAM41409 


TP 1* * 

Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 6340. 


r\r\ a 

994 


on 

99 


1037 


A A nr A1 

AAY48501 


T T * 

Homo sapiens 


MET A- Human breast tumour- 
associated protein 46. 


HOT 

98 / 


AO 


1037 


A A T~» f ' — 7 A A J? 

AAB67445 


T T * 

Homo sapiens 


INCY- Amino acid sequence of a 
human chaperone polypeptide. 


O C A 

oo9 


1 AA 
100 


1038 


AAM83952 


Homo sapiens 


T TT T\ /T A T T 

HUMA- Human 

immune/haematopoietic antigen SEQ 

TT* ATO- 1 1 <A ^ 

1JJ iNL>. 1 1D4D. 


C70 

572 


A'? 


1038 


AAG01992 


Homo sapiens 


GEST Human secreted protein, SEQ 


296 


100 


1 m o 


aaiod /yu_ 

aai 


riomo sapiens 


Or5JrrJ iNucieotio.e sequence oi a numan 
jryD/oJ_//\Jr i protein. 


/o 


ZO 


1 n^o 


AAUZUj Id 


riomo sapiens 


jnLUivi/\.- Jriuiiidn secreieo proiem, ocq 
TF> TsJn SOS 




00 


1039 


gi20379968 


Mus musculus 


similar to FLJ00101 protein 


140 


33 


1 ai n 


„: 1 r\A A r\c i a 

gllU44Uj 1U 


rlomo sapiens 


T7T TA A 1 A 1 «Vi-itiain 

rj^juuiui protein 


1 ^0 

1 Dzf 




1040 


AAU20513 


Homo sapiens 


HUMA- Human secreted protein, Seq 

ID iNO 


959 


77 


1 A/1 A 

1U4U 




JVLus museums 


similar to pi^juuiui protem 




■^^ 
jj 


1040 


gil0440510 


Homo sapiens 


FLJ00101 protein 


139 


36 


1041 


gil2o_)zo4y 


Homo sapiens 


ribosomal protein L28 




/O 


1041 


gi488835 


Mus musculus 


ribosomal protein L28 


631 


75 


1041 


gi571 13 


Rattus 
norvegicus 


ribosomal protein L28 (AA 1-13 /) 


/C1 1 

oil 




1042 


ABB 11796 


Homo sapiens 


HYSE- Human secreted protein 
nomologue, oxiv^ iu jn kj.Z loo. 


4262 


99 


1042 


AAE07171 


Homo sapiens 


PHAR- Human HuIFRG-3 protein. 


4249 


99 


1042 


A a tv /TO CO /I 1 

AAM25341 


Homo sapiens 


HYSE- Human protein sequence SEQ 

1U JNCJloDO. 


zzo/ 


AA 

yy 


1043 


AAB43873 


Homo sapiens 


HUMA- Human cancer associated 
protein sequence bxi^ iu iNvj.iJio. 


1004 


91 


1043 


gi306809 


Homo sapiens 


glutathione S -transferase 


1004 


91 


1043 


gliUoo ID 


Homo sapiens 


glutatnione o-transierase (Lro 1 , xiC 


1 AA/I 

1UU4 


01 
y 1 


1044 


AAJ343873 


Homo sapiens 


HUMA- Human cancer associated 
protein sequence o-bQ iu jnvj. lo lo. 


990 


90 


1044 


gi306809 


Homo sapiens 


glutathione S -transferase 


990 


90 


1044 


gi306815 


Homo sapiens 


glutathione S -transferase (GST, EC 
2. 5.1. 18) 


990 


90 


1 A/1 ^ 

1U4D 


A/VX54JO / J 


Homo sapiens 


rlUJVlA- rluman cancer associatea 
protein sequence SEQ ID NO: 13 18. 


lUUO 


Q1 

y 1 


1045 


gi306809 


Homo sapiens 


glutatliione S -transferase 


1008 


91 


1045 


gi306815 


Homo sapiens 


glutathione S -transferase (GST, EC 
2.5.1.18) 


1008 


91 


1046 


AAM90773 


Homo sapiens 


HUMA- Human 

immune/haematopoietic antigen SEQ 
ID NO: 18366. 


905 


97 


1046 


AAB93267 


Homo sapiens 


HELI- Human protein sequence SEQ 


753 


43 
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ID NO: 12300. 






lU^f u 


|reflXP_l25 


ivius mus cuius 


simuai to expressed sequence l-ojh-j / 


*T J O 1 


O J 


1047 


gi520450 


Homo sapiens 


sorbitol dehydrogenase 


1733 


94 


1047 


gi496086 


Homo sapiens 


L-iditol-2 dehydrogenase 


1726 


94 


1 A/17 

1047 


AABjo /4g 


Homo sapiens 


ROSE/ Human prostate cancer antigen 
protein sequence SEQ ID NO: 1326. 


l0Z2 


Q/t 


1048 


AAB58331 


Homo sapiens 


ROSE/ Lung cancer associated 
polypeptide sequence SEQ ID 669. 


4 /o 


an 
y / 


1 n A o 

1048 


gi6066468 


T „ J „ 1 * „ 

Leishmama 
major 


probable DNA polymerase zeta 
catalytic component 


5Z 


51 


1048 


AAM88708 


Homo sapiens 


HUMA- Human 

immune/haematopoietic antigen SEQ 
ID NO: 16301. 


77 


43 


1049 


AAM49036 


Homo sapiens 


BODE- Human phosphatidylinositol 3 
(r tains 3) -kinase 36. 


1633 


100 


1 A /t A 

1049 


A AT>nC707 

AAB95783 


Homo sapiens 


HELI- Human protein sequence SEQ 
1JJ \S\J: 1 6 15 D. 




no 


1049 


AAM9402 1 


Homo sapiens 


HELI- Human stomach cancer 
expressed polypeptide oii^ id invj 
112. 


1626 


99 




AA±>001J 1 


riomo sapiens 


ri u ivi/v- riuman secrete a protem 
sequence encoded by gene 55 SEQ ID 


QzlS 


yJ 


1 nsn 


glOO*+ LDDH 


nuiiiu sdpiciib 


rior i oo 


QACi 




1050 


gi20987916 


Mus musculus 


RIKEN cDNA 1810029F08 gene 


849 


86 


1 n< 1 




riomo sapiens 


xiiiiji- riuman pro tern sequence oxiv^ 

LLJ INV^. 1 J f\JD. 


1 Q 1 Q 

loiy 


QQ 

yy 


1051 


g il 8447408 


Drosophila 
meianogaster 


RE13191p 


144 


29 


1051 


gil5809236 


Plasmodium 
iaiciparum 


merozoite surface protein 2 


117 


38 


1052 


gil5559064 


Mus musculus 


SNAG1 


373 


56 




glOl04oZo 

• 


Homo sapiens 


bhL3 and rX. aomam-contaimng protem 
SH3PX1 




4z 


1 AC') 


i ao /f o 
glj41UZ4y 


Homo sapiens 


oJJrl piotem 






1053 


gi21410770 


Homo sapiens 


Similar to RIKEN cDNA 1500005K14 
gene 


1137 


100 


1053 


gil8958091 


Danio rerio 


SI:bZ71M17.1 (novel protein similar to 
mouse proteins) 


384 


40 


1053 


AAM93224 


Homo sapiens 


HELI- Human polypeptide, SEQ ID 
NO: 2637. 


149 


43 


1054 


AAM79121 


T T * 

Homo sapiens 


HYSE- Human protein SEQ ID NO 
1783. 


21 ll 


95 


1054 


gi28384 


TT • 

Homo sapiens 


5 half of the product is homologues to 

-Dd-OlllUS SUDtUS O-TTwl^/Vrv Syilllieiabc, D 

half corresponds to the catalytic 
subunit of AIR carboxylase 


2111 


A C 

95 


1054 


gil 7939425 


Homo sapiens 


multifunctional polypeptide similar to 
SAICAR synthetase and AIR 
carboxylase 


2111 


95 


1055 


AAG01499 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID NO: 5580. 


293 


100 


1055 


gi 20875035 


Mus musculus 


similar to RNA polymerase II 


74 


25 
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xxit JLU 
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c 

score 


identity 




|reijAr ui 
118.1| 




termination iactor 






1 AC/: 
lloo 


gll /y4D / oJ 


JJiosopnila 
meianogaster 


Kri34Uoyp 




DO 


1056 


gi2072290 


Xenopus laevis 


XL-INCENP 


113 


25 


1 A<CA 
1UDO 


rril 1 A1 Q Q/l 1 

glZ 1Ujoo4 I 


oaenornauoitis 
eiegans 


Y jyrSoA. 1 o 


1 1 ^ 
1 ij 


0^ 

ZD 


1057 


AA013523 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 

1 N \J Z/T'lJ. 


1302 


83 


1057 


AAB95348 


Homo sapiens 


HELI- Human protein sequence SEQ 


111 


29 


1057 


gil4550467 


Homo sapiens 


spermatogenesis associated 2 


111 


29 


1 A^Q 


AAlVlDZO / Z 


riomo sapiens 


oriAiN- numan microgioounn 
transcriptional control iactor ju. 


1 1 79 
1 1 /z 


527 
0 / 


1058 


gil6588712 


Homo sapiens 


P33 


1172 


87 


1 AC O 


^■51 /ncni /en 
gl!4zj)Uloy 


riomo sapiens 


bimiiar to leucine zipper protein 
r Kou i 


I 1 70 

I I /Z 


C7 
O / 


1059 


AAO 16945 


Homo sapiens 


KYOW Human NF-kappaB activity 
ennancmg protem oxiv^ ixj invj. i. 


2622 


100 


1 asq 
lioy 




riomo sapiens 


xv. i w w numan in jt -Kappao activity 

pnliQnf'i'nfT nrntpin O In '(~\ TT^ T\Tf^ • ^ 
CllliailL'i.llg piUlClll OlZiK^/ 1LJ 1NW. u. 


ZOUo 




1059 


AAB62180 


Homo sapiens 


PLAC Human p95 protein. 


2395 


89 


1 a/^a 


gll O04 oZ jU 


urosopniia 
melanogaster 


unZj oZ3p 


^00 


jj 


1 a/^a 


~;niCA1 1/1 
gli J 1501 14 


riomo sapiens 


rao interacting lysosomal protein 


onn 
zuu 


jU 


1060 


AAG02093 


Homo sapiens 


GEST Human secreted protein, SEQ 

ID PiKJ. Ol /4. 


136 


41 


1061 


gil 0440398 


Homo sapiens 


FLJ00032 protein 


2225 


60 


1 A/^ 1 

lUo 1 


glU / jz / j4 


riomo sapiens 


zinc linger 1111 


0 1 fn 
z 1 0 / 


Dy 


1061 


AAM79739 


Homo sapiens 


HYSE- Human protein SEQ ID NO 

JJOJ. 


2164 


54 


1062 


ABB55706 


Homo sapiens 


FECH/ Human polypeptide SEQ ID 
NO 18. 


1476 


99 


1 A^O 


A ATTO QQQ7 

aau Joyy / 


rlomo sapiens 


LrxiJVi y riuman secreted protein 
yall 1. 


1 47^ 

14/0 


QQ 

yy 


1062 


AAY 17227 


Homo sapiens 


GEMY Human secreted protein (clone 
yal-1). 


1476 


99 


1063 


A A "DA/! CCO 


Homo sapiens 


HliLl- Human protein sequence ISJiQ 

1JJ INO: lw)3Zo. 


jyj 




1063 


AAM94024 


Homo sapiens 


HELI- Human stomach cancer 
expressed polypeptide bx±Q ID NU 
118. 


593 


75 


LVOD 


gl Id /y Iz / 


JNeissena 
meningitidis 
Z/Z4y i 


pnospnoenolpyruvate carboxylase 


00 


A A 

4U 


1064 


gil3752754 


Homo sapiens 


zinc finger 1111 


2345 


58 


l U04 




riomo sapiens 


PT TnAA70 r>T-rtt<ain 

ri_,juuujtz protein 


ooon 
ZZ7U 




1064 


AAB95862 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 18929. 


2085 


59 


1065 


AAO 11604 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 25496. 


888 


100 


1065 


AAO04671 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 18563. 


270 


43 


1065 


AAO01369 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 15261. 


202 


39 
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1066 


AAE 16475 


Homo sapiens 


OSTE- Human collagen alpha 1 (I) 
proiem. 


75 


38 


1066 


ABB90764 


Homo sapiens 


UYJO Human Tumour Endothelial 
iviarKer poiypcpuuc oxiv^ -u-' invj z-ui. 


75 


38 


1066 


ABB09625 


Homo sapiens 


OSTE- Amino acid sequence of human 
collagen rype i aipna i . 


75 


38 


1067 


AAG65158 


Homo sapiens 


BIOW- Human negative regulator 21 
oi programmea ceu ucaui. 


710 


99 


1068 


AAB95237 


Homo sapiens 


HELI- Human protein sequence SEQ 

ID INvJ. 1 / Jo4. 


300 


30 


1068 


ABB05725 


Homo sapiens 


vjiiriu- Human nucieic aciu 
management protein clone tes3 31al0. 




30 


1 A /C O 

1068 


A A TD Cil 1 on 

AAr>y3 izy 


Homo sapiens 


Pi r, I / 1 - XlUIllall piuiclii iscmJ-CiivyC oiiy 

ID NO:12017. 


298 


30 


i a/ca 

iooy 


AAr5yDD4u 


nomo sapiens 


nXiJ_,l- nUIHall piUlClll oClj[UCilOC Oi^y 


2482 


99 


1069 


AAM79264 


Homo sapiens 


HYSE- Human protein SEQ ID NO 
1Q96 


1016 


44 


1069 


gi8308176 


Homo sapiens 


cingulin 


1016 


44 


10/0 


AAB4 / 134 


Homo sapiens 


3478571CD1. 


^992 


99 


1 r\'~i r\ 

1070 


A A T TOO A 1 *7 

AAUozUl / 


riomo sapiens 


iiNv^ i - numan sccrciecj proiem 
SECP43. 


21 9S 


51 
~j i 


1 f\'~! f\ 

1070 


giloo /o /lo 


Homo sapiens 


ri^JUUZD/ protem 


2049 


54 


1 A^7 1 
10/1 


gllz3UOD / 


oaccnaromyces 

• • 

cerevisiae 


osaip 


1 62 


33 


1071 


gil72697 


Saccharomyces 
cerevisiae 


SRK1 


162 


33 


1071 


gil /zolz 


oaccnaromyces 

* * 

cerevisiae 


ooJL/i proiem 


1 6? 


33 


1U /Z 


AAKOU 1 Z / 


nomo sapiens 


lVl/\kjl JTIU.lild.il 11C O till ^JlVJlClll lo U5C1U1 

to identify brain tumours. 


1637 


91 


1 A^70 

10 /z 




Homo sapiens 


ncsiin 


1637 


91 


1072 


AAR27205 


Homo sapiens 


MASI Human nestin. 


1622 


90 


1 ATI 
10 / J 


A A VQ 1 /I O /t 

AA Y y 1 4Z4 


Homo sapiens 


llUiVljrA.- XlUIIla.ll bCL/lCLCU. piULClll 
oCUUCllv^C ciil^lju.cli uy gciic; i^. kjJjy iJ--' 

NO:145. 


994 


96 


1 

1U / 3 


gll oO / oouu 


riuiiiu bapiCIlo 


FT T001 OR nrntein 


111 

J- A- JL 


31 


1073 


gil532071 


Zea mays 


glycine-rich protein 


96 


28 


1 f\H A 
1U /4 


A Aft 1 1 "7*7A 

A Ari 1 1 / / U 


riomo sapiens 


protein. 


7638 


98 


1 C\H A 
10/4 


glZUjz / 0 


Kattus 
norvegicus 


male germ CCll-aooOL'la.LCiJ. IvlllaoC ^lllalv j 


91 83 

Z» 1 OJ 


80 


10/4 


gojy 14 


ivius mus cuius 


rcK 


21 50 


79 


1 V /D 


AAvJUj j 34 


xiomo sapiens 


vjriij 1 ri U. 1 1 la 11 oCLICLCU jjivJLCiii, uLiy 
TD "MO* S 


373 


100 


1075 


AAM25814 


Homo s aniens 


HYSE- Human protein sequence SEQ 
ID NO: 1329. 


69 


30 


1075 


AAM25290 


Homo sapiens 


HYSE- Human protein sequence SEQ 
ID NO: 805. 


69 


30 


1076 


gi7578595 


Mus musculus 


teashirt 2 


4570 


89 


1076 


gi7527470 


Mus musculus 


zinc finger protein 


2311 


48 


1076 


gil9481304 


Danio rerio 


teashirt-like zinc finger protein 


1785 


48 


1077 


gil4193749 


Mus musculus 


zinc finger 142 


165 


28 


1077 


gil510147 


Homo sapiens 


similar to Human zinc finger 


164 


28 



wo 
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S 


Percent 

irlfknH tv 
luciiii i.y 








protein(ZNF142) 






1 077 
1U / / 


m£Q7R09£ 

giuy / ouzo 


lvius museums 


zinc linger protein z-ijw^i 


1 61 
1 Ul 


9^ 


1 07R 


A AV^^A 
l\J\ I 040.3U 


xiomo sapiens 


uiio 1 riuniaii luman nomo 10 gy 

pi UtC ill. 


1 

7J 1 


yy 


1078 


AAY01635 


Homo sapiens 


ABBO Human PS214 derived 

puiypcpLK-ic. 


951 


99 


1 07R 


A A VSQ6R9 


T-Tr\TYi/"v comptio 
XXUlliU odpiOlio 


vJJj/O JL OCvlClCU piVJLCULl x\JO \J\jy ~> \J 

A 7 -FT 

^x^ _ X Xu, 


0S1 


QQ 

yy 


1079 


AAB36595 


Homo sapiens 


INCY- Human FLEXHT-17 protein 
<?eniienre SFO TD NO' 17 


1339 


100 


1079 




xx^jiiivj odtJit^iio 


IJ -J sLVJ 


1339 


100 


1079 


gil9353786 


Mus musculus 


expressed sequence AI837181 


1305 


96 


1 ORO 


A A"M"3QR1 5 


nomo sapiens 


11 x je- nuiudn poi^/pcpinic oJC/V^/ xjl' 
NO 2960. 


4S1 

H 3 1 


49 


1080 


gil4161367 


Homo sapiens 


megakaryoblastic leukemia- 1 protein 


451 


42 


iUoU 


gll4U4l04o 


xlomo sapiens 


rvTT MAT 


4D 1 


A 9 
4Z 


1081 


AAB36589 


Homo sapiens 


INCY- Human FLEXHT-1 1 protein 
sequence o±i^ lu iNw. 1 1. 


742 


100 


1 HQ 1 
lUo I 


A A nQ11 QQ 


riomo sapiens 


iNioL/- riuman protem xiJr i uo4 1 . 


7/1 9 


1 no 

1UU 


1081 


AAU28389 


Homo sapiens 


HYSE- Novel human secretory protein, 
oeq iu ino /^fo. 


742 


100 


1 0R9 

1UOZ 


A 1 

/ JjZI 


nomo sapiens 


xvvjoxi/ riuman secretea proiein 

sequence encoded by gene 34 SEQ ID 
TvJO- 1 40 

IN V_/. JLtv/. 


^07 


1 on 


1 OR? 


ai'}R7^79 1 


Vw^aCIlOllla D Ul 11 S 

F»l PCT 51 TIC 


T-71 1P1 1 
r i lull 


1 47 

It 1 / 


^9 


1 08? 


ail 1 55R746 


lV/Tnc TniicniliiQ 
1VXU.O lllLloO lllllo 


r*5i 1 CA/ntpniTi— 1 T\rnf pi ti 
L/CHo_yiiicixiii x pivJLCiii 


R5 


3R 


1083 


gil 8089024 


Homo sapiens 


Similar to three prime repair 

C/A-lJlXlXL/JLC'CloC' X 


3331 


100 


1083 


gil7227176 


Homo sapiens 


ATR-interacting protein 


3331 


100 


1 OR^ 


±\±\±^y oy o 0 


T-T<"\TYH"l CQtM PT1C 

nomo sapiens 


XJLJ_/JLvl - XJLUXilctll piULeill bCqUCilCC OJI/V^/ 

TD TsTO- 14000 


9Q1 0 


1 00 


1084 

J. UOt 


ei214105R7 


T-TfYtnn ct^ni pn q 
XXVX111U ocipicno 


similar to RTKFTsF rD"NTA 2^10041H0fi 


R1 R 

O X o 


100 


1084 


gi6855513 


Gallus gallus 


syndesmos 


549 


63 


1 0R4 


cri 1 RO^A^RR 


1V1US IllUSCUlUb 


s y I1U.C biiiu s 


^^7 


SR 


1 OR^ 


A AVKQARdd 


ITIOIIIO oapicno 


xxxj/Ijx- xiuman proiem sequence ony 
ID NO: 16020. 




1 00 
l \J\J 


1 OR^ 


A AT T1 ^Q^Q 

/V/\LJ 1 J7J7 


xiomo sapiens 


xiUiViA- riuman novel secretea protein, 
Seq ID 892. 


679 
O / Z 


1 oo 

1UU 


1 or<; 


rri90'*n6/179 


ivius mus cuius 


JviivxiiN cuiN/v Z4UUUUQ1U4 gene 


*\06 
DUO 


R1 
o 1 


1086 


gil2659138 


Mus musculus 


mage-d3 


144 


31 


lUou 


gll33 l04oo 


Mus musculus 


MA(jrJb-necain/tropnimn complex 
(iviagpninm J 


1 1 Q 

loo 


o 1 


1086 


gil2697573 


Mus musculus 


trophinin-2 


138 


31 


1 OR7 
lUo / 


gizoyDoz>y 


rsos taurus 


pyruvate aenyarogenase pnospnatase 
regulatory subunit precursor; PDPr 




RO 


1087 


2.14022085 


A/f P C Ci v\\ 1 'TT* iS i 1 1 TYl 
XVXCOtJlllXZjVJLJlU.111 

loti 


octxcusuie ticiiy vxnjgciicioc 


346 


37 


1087 


gil5075297 


Sinorhizobium 
meliloti 


PUTATIVE OXIDOREDUCTASE 
PROTEIN 


344 


39 


1088 


gil 0440476 


Homo sapiens 


FLJ00075 protein 


843 


98 


1088 


AAY35936 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 185. 


395 


100 


1088 


AAB64417 


Homo sapiens 


INCY- Amino acid sequence of human 
intracellular signalling molecule 


360 


100 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 


Percent 

irfpnfitv 








FMTP A AO 






1089 


gi538413 


Mus musculus 


zinc finger protein 


3110 


94 




gil oo / /4 


Homo sapiens 


zinc linger protein 


9479 


S6 




gi2739j 


Homo sapiens 


ZJNry 1L, 




S4 


1090 


AAG79225 


Homo sapiens 


GEMY Amino acid sequence of a 
numan r ovjjo- i Dinamg protein. 


1640 


99 


1090 


AAG79120 


Homo sapiens 


DAUS- Amino acid sequence of 
louipiox proiem. 


1640 


99 


1090 


gil4719305 


Mus musculus 


SNX20 


1241 


77 


1091 


AAB99495 


Homo sapiens 


AK±>vJ- Human LLAor-J protein 
sequence SEQ ID NO:2. 


0 / / j 


OQ 


1091 


gil4597912 


Homo sapiens 


numan CLAo-ro 


0 / / j 


QQ 


1091 


AAB99493 


Homo sapiens 


ARBO- Preliminary human CLASP-3 
protein sequence Fig l . 


6123 


99 


1092 


gi2425 111 


Dictyostelium 
discoideum 


ZipA 


1 A(\ 


9fk 


1092 


gi5 77221 


Saccharomyces 
cerevisiae 


— - — — - 

Imhlp 


1ZJ 


90 


1092 


gil0178157 


Arabidopsis 
thaliana 


contains similarity to neat snocK 
protein-gene id:MPO 12.20 


1 1 Q 


99 
ZZ 


1093 


AAB63245 


Homo sapiens 


LUUW- Human breast cancer 
associated antigen protein sequence 

otS^ 1JLJ JNvJ.OU/. 


l f ID 


1 nn 


1093 


gil81247 


Homo sapiens 


cyclin D3 


1514 


100 


1093 


gil5079591 


T T _ • 

Homo sapiens 


Similar to cyclm D3 


1^1/1 

ID 14 


1 r\n 


1094 


gi8926320 


Rattus 
norvegicus 


corneal wound healing related protein 


3606 


95 


1094 


gil6768554 


Drosophila 
melanogaster 


GM08606p 


1275 


38 


1094 


gil9528l51 


Drosophila 
melanogaster 


AT26759p 


970 


37 


1095 


AAB93319 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID JNCJ: iz4Uo. 


2776 


99 


1095 


AAB94051 


Homo sapiens 


HELI- Human protein sequence SEQ 
1U JNvJ: 14Zlo. 


2227 


100 


1 nnc 
1U9D 


AAB9U /lz 


Homo sapiens 


vjrrSJVl Y rluman duZ i y_i protein 
sequence SEQ ID 120. 


&9^ 


yo 


ll)9o 


AAB4J434 


Homo sapiens 


JnLUJViA- Human cancer associatea 
protem sequence any 117 rsvj.o /y. 






1096 


AAB57205 


Homo sapiens 


ROSE/ Human prostate cancer antigen 
protein sequence SEQ ID NO: 1783. 


453 


81 


1096 


AAY07039 


TT * 

Homo sapiens 


LUDW- Breast cancer associated 
antigen precursor sequence. 


440 


o4 


1097 


gil3543594 


Homo sapiens 


similar to rat myomegalin 


865 


98 


1097 


gi20072822 


Homo sapiens 


Similar to phosphodiesterase 4D 
interacting protein (myomegalin) 


838 


94 


1097 


gil9263586 


T 1" _ • 

Homo sapiens 


similar to rat myomegalin 


QIC 
ODD 


y 1 


1098 


AAM73511 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein SEQ 
ID NO: 33817. 


202 


100 


1098 


AAM6083 1 


Homo sapiens 


MOLE- Human brain expressed single 
exon probe encoded protein SEQ ID 
NO: 32936. 


202 


100 


1098 


gil 56262 


Caenorhabditis 
elegans 


collagen 


93 


35 



wo 



03/029271 



PCT/US02/30474 



250 
Table 2B 



SEQ 

xU 


Hit ID 


Species 


Description 


S 

atUI c 


Percent 

iiiciiii \.y 


lUyy 


X\DD 1 Jj / Z 


nonio sapiens 


xiuiVLfV- xiuiiiaii nervous sysiein 
related polypeptide SEQ ID NO 4029. 




OH 


1 uyy 


rrt 1 CI ,4C7Q^ 

giiz) ItO /yj 


oils scroia 


oasic proime-ncn protein 


1 JZ 


^9 


1 noo 
luyy 


rr-J 1 70/1 CO QO 

gll /y4DooZ 


urosopniia 
melanogaster 


t?T7 1 71 /^C-»-» 

Kx-l / 1 top 


i^tu 


1 1 

D 1 


1 1 HA 
1 1UU 


rr_9 A97 1 1 


ivius muscuius 


iiorous sneatn-mter acting protein l 


DDV 




1100 


gi7800648 


Streptococcus 
pneumoniae 


PspA 


110 


25 


1100 


gi3 171906 


Homo sapiens 


DIA-156 protein 


107 


25 


1 1 A 1 

1 101 


gll iyo99oU 


Homo sapiens 


d.Jzol(jrz3.z.l (novel protem, isoiorm 

i) 


1 HC\A 

1 /U4 


1 AA 
1UU 


1 1 A 1 


AAA A A A1 /CO 

AAM4y loo 


Homo sapiens 


OTT A XT Un«v%n« 1 _____ + ' „ on o/; 

onAN- Human Keratm 3U.3o. 


11/1/1 
1 144 


AO 

yo 


1 1 A 1 

1 1U1 


1 A 00070 

gii lyoyy /y 


rlomo sapiens 


uJzo ICjZJ.z.z (novel protein, isoiorm 
2) 


1 144 


OQ 

yo 


1 1 AO 

1 IUz 


A A "DO/1 C 


Homo sapiens 


rllil^l- rluman protein sequence oiiv^ 
ID NO: 15341. 


oooo 

zzy / 


QO 

yy 


1 1 AO 

1 IUz 


A TDD QO/£0 A 

ABBoyozU 


Homo sapiens 


XJT TA yf A T T *»rtUt«Q.«ti/4Q CCA TT". 

rlUJvlA- rluman polypeptide oiii^ 1U 
NO 1996. 


ol / 


yo 


1 1 AO 

1 IUZ 


A A T T1 AOQ 1 


rlomo sapiens 


rJUJJri- rluman aiaenyae/Ketone 
reductase 36. 


171 
1/1 


O/. 
jO 


1 1 AO 


rri 1 097£C7 1 

gll jZ / 00 / 1 


xiomo sapiens 


cnriu synaptotagrnin 


QO/1 


y© 


1103 


gi213111 


Discopyge 
ommata 


synaptic vesicle protein 


374 


33 


1103 


gil932801 


Rattus 

■ 

norvegicus 


synaptotagrnin X 


370 


35 


1 1 A/t 

1 1U4 


© Q/1 Q O O O 

glloo4o333 


Homo sapiens 


+. _. T) TtTXT «T\\T A A AO A/I COXTO/1 

bimilar to KiJ<LbJN CDJNA 4yiU4j3JNZ4 
gene 


1 O'/l A 

1 /40 


1 AA 
1UU 


1 1 HA 
1 1U4 


m* 1 OfM o o o^; 
gll oU4o3Z0 


ivlus museums 


Qimilor T> TT/'T.XT ^TYMA /IOOA/l^OXTO/1 

oimiiar to ivixvriiN cjjina 4yjU4j jjnz4 
gene 


1 /10 ^ 
14JD 


QO 
OD 


1104 


AAY73389 


Homo sapiens 


INCY- HTRM clone 4173 1 1 1 protein 
sequence. 


1023 


100 


1 lUJ 


/\/\ w h- / zuy 


nomo sapiens 


u UN i nomo sapiens tuDuim-ioiamg 

t/Uiat/lUI X5. 


1 Ut-Z 


OD 


1105 


gi2343185 


Homo sapiens 


tubulin folding cofactor B 


1042 


85 


1 1 OS 


cri1 ZL71 

gl lH- / 1 JU'tH- 


ivius muscuius 


rviJSJZ/iN cljl > /v Z'f i uuu / u i z gene 


you 


77 


1106 


gil 6356663 


Homo sapiens 


erythrocyte membrane protein 4. IN 


4359 


96 


1 1 n.£ 


gio /yuo^o 


ivius muscuius 


neuronal protein 4. 1 


zlOA^ 
^Z^f J 


QO 

y j 


1106 


gi4587118 


Rattus 
noi^egicus 


rat brain4.1(S) 


4233 


93 


1107 


gil6356663 


Homo sapiens 


erythrocyte membrane protein 4. IN 


4527 


100 


1 1 r\i 
i iu / 


_*-.0'7AAC/1 C 

gi3 /yUD4D 


Mus muscuius 


neuronal protein 4. 1 


/I O iC/1 

4304 


AC 

yD 


1 1 Cil 

1 1U / 


~;a c OT1 1 o 
gl4Do l\ lo 


Kattus 

• 

norvegicus 


rat bram 4.1(b) 


A O C 0 


AC 

yj 


1 1 no. 

1 1UO 


A AT5O00AO 

/v/vtsyj juy 


rlomo sapiens 


xiiii^i- rluman protem sequence orii^i 

H-J iNW.lZjOJ. 


9/17 
Z^ / 


9 K 

ZD 


1108 


gi4584539 


Arabidopsis 

tVia liana 


extensin-like protein 


194 


36 


1108 


gi330444 


Human 
herpesvirus 4 


nuclear protein EBNA2 


185 


37 


1109 


gil0179839 


Homo sapiens 


Thy-1 co-transcribed protein 


761 


100 


1109 


gi2 1070344 


Homo sapiens 


GAS2-related protein isoform beta 


100 


32 


1109 


gi2944066 


Cards familiaris 


synapsin I 


98 


32 


1110 


gil5144271 


Homo sapiens 


NYD-SP11 


2037 


61 


1110 


AA014415 


Homo sapiens 


UYNA- Human testis development 
specific protein- 1 1, NYD-SP1 1. 


1934 


64 
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ID 


Hit TD 

All I J~i-J 






s 

score 


Percent 
identity 


1110 


gil0178281 


Arabidopsis 

til a 1 1 a n a 


katanin p80 subunit-like protein 


113 


24 


1111 


gi 15 144273 


Homo sapiens 


protein kinase NYD-SP24 


1563 


100 


1111 


cn1 SR?1644 

gi 1 JO^w J U r r 


l-T r\ t-n r\ carrip'Tl Q 
llUlilxJ oCtjJlCllO 


Inner ■frnrn 

1U11H 1U1111 


1559 

i +j ~j y 


99 


1111 


gil8158217 


Homo sapiens 


calcium-response factor CaRF 


1559 


99 


111? 

1 1 1Z, 


A A O6707Q 


nuiiiu sapiens 


Dlw VV - 11 Ul Hall UlliyvllUUlLILcloC 

polypeptide. 


1 1 09 
1 1 \jy 


y o 


1 1 1 z. 


cril 1 QQ4713 
j^i 1 1 yy-x / 


A Y"ci1"\iHfYrici c 
.r\.l ctUlLUJpolS 

thaliana 


L/Uiiiaiiio oliiiiiaiiLj/ l\J i^cii w aii-^Jicioiiia. 

membrane linker 

trmtf^iTi^crf^ti p irl'lVfT^" A?^ S 

UHJLCHl J^CllC !U<lVll\^\Zf 


1 1 ? 


24 


1112 


AAB95856 


Homo sapiens 


HELI- Human protein sequence SEQ 
TD "NTO'1 RQ1 6 


105 


27 


111^ 

1 1 1 D 


A ARQ^R7? 


T-Tnmn carti/^nc 
jnnjniu odjjiciio 


T-TTh T T_ T— Ti i -m ^ n "nrnt^Mn cf^mif^n^f* ^IT^f) 

HCLfl 1 1 Ullltlll pUJlClll oCL^U.C'llL-C uljy 

ID NO: 13771. 


1110 

1 1 1 \J 


1 00 

1 w 


1113 


gil8146803 


Mus musculus 


neural-restrictive silencer factor 

111 SI/ ICS I 


104 


28 


1113 


gi560496 


Cercopithecine 
herpesvirus 1 


glycoprotein G (homologue of HSV-2 
US4) 


99 


31 


1114 


A ARQ4H4Q 


nomo sapiens 


riiiJLri- jTLUiiiaii proiem sequence odv^ 
ID NO: 14213. 




1 no 

1UU 


1114 


gil5080775 


Homo sapiens 


protein kinase NYD-SP5 


1562 


99 


111/1 


/VAlVi / Z / / 1 


riomo sapiens 


ivi KJL,rj- riuman Done marrow 
expressed probe encoded protein SEQ 
ID NO: 33077. 


Zjy 


1UU 


111^ 
1110 


gizuu / zyo i 


riomo sapiens 


similar to zinc linger protein irom gene 
ui unuei Lain cxuii bLrucLure ? siimiai lu 

Q99676 (PID:g3025333) 


z / oo 


QQ 


I 1 1 S 

I I u 


cn"41 SQRRR 

gl'r 1 J2/000 


T-T/-*tyi/-\ conipTio 
XT.U1IHJ bctpiClla 


vmo "fin CTG*ir rxrntpin ■ft'/^Tn rrA^Tif^ 
ZilllL' llllgCl jJlULClll 11U111 g,CHC Ul 

uncertain exon structure; similar to 
OQ9676 /'PTD*p30?5333'I 


?? 1 ? 


00 


1115 


AAM68363 


Homo sapiens 


MOLE- Human bone marrow 

CApiCo&CU piUUC C11L/UVJ.CL1 UlULClll Ul_/V^ 

ID NO: 28669. 


1947 


100 


1116 
1 1 1 \j 


cri 1 1 464740 

gl lit U*+ / Hv/ 


XlUillU SdpiCllS 


HlgctAUlllll 


^160 

D 1 Uu 


1 no 


1116 


AAG74478 


Homo sapiens 


HUMA- Human colon cancer antigen 

nrntpin WO TD TsJO-S?4? 
|ji \j iciii oxjv^ l±j in . »J Z*tZ , 


757 


97 


1116 

1 1 1U 


A ACt761 SQ 


11U111U odUlCllo 


T-TT TA/T A _ Hnmtin p/iI^ti nnti hf*v "antiorpTi 
1 1 1V1.ttl— XlLlllldll UU1U11 L/CtllOCl dlltlliCll 

protein SEQ ID NO:6923. 


640 


QQ 

yy 


1117 
111/ 


A ATT1 


ntJlllU acl^JlCllS 


T-TT TIV/T A _ T-TiiTYici'ti nrti/pl cppfptprl nfrit'piTi 

11U IVluTV XHlllldll 11UVC1 dCLlClCU UlULClil, 

Seq ID 804. 


?^0^ 

A. -J \J ~J 


QQ 

yy 


1117 
ill/ 


A ATT1 6^ 1 ? 


nomo sapiens 


rnjivi^v- nunidn novel secreieu proLcm, 
Seq ID 1265. 


1 606 


Q7 


1117 




riomo sapiens 


olio i numan secreiea proiem s ony 
ID NO: 6135. 


67R 
O / o 




1118 


gil7390199 


Homo sapiens 


Similar to zinc finger protein 202 


1665 


99 


I 1 1 R 

I I lo 


A A V")OQ 1 Q 

AA i Zyy 1 o 


riomo sapiens 


ivi y -Ki- riuman urUJ i protein encoaea 

Viv frfln sprint pDNA? 


744 


JO 


1118 


AAY29917 


Homo sapiens 


MYRI- Human CHD 1 protein encoded 
by transcript cDNAl . 


744 


38 


1119 


AAA27125 
aal 


Homo sapiens 


UNII Human repressor of estrogen 
repressor activity (REA) cDNA. 


1299 


89 


1119 


ABB11913 


Homo sapiens 


HYSE- Human B-cell receptor 
associated protein homologue, SEQ ID 
NO:2283. 


1299 


89 


1119 


AAY94443 


Homo sapiens 


UNII Human repressor of estrogen 


1299 


89 
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SEQ 
ID 


Hit ID 


Species 


Description 


S 

c<*nt*A 
atUI C 


Percent 








repressor activity (REA) protein. 






1 120 


AAG93283 


Homo sapiens 


JNlbC- Human piotem rlr* 1 uozo. 


1 1 Q 


1 nn 


1120 


gil5559443 


Homo sapiens 


bimilar to KlKLJN cdjna zo 1U41 /j iz 
gene 


1 ly 


1 

1UU 


1120 


gil9354196 


Mus musculus 


RIKEN cDNA 2810417J12 gene 


500 


71 


1121 


AAB58852 


T T * 

Homo sapiens 


HUMA- Breast and ovarian cancer 
associated antigen protein sequence 

kb±i^ IU DDK). 


311 


1 AA 


1121 


AAM39029 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 

JNvJ Zl /4. 


75 


30 


1121 


gi6899807 


Homo sapiens 


zinc finger protein 219 


75 


30 


1 1 O 1 

1 122 


gi809934o 


Homo sapiens 


zinc linger protein 


ID /o 


3Z 


1122 


AAM00913 


Homo sapiens 


HYSE- Human bone marrow protein, 

Cun TT» "MTV ICQ 


1369 


62 


1 122 


AAJVl/o9o3 


Homo sapiens 


±1 Y on- riuman pro tern oiiv^ id in u 




oz 


H23 


AAG02740 


Homo sapiens 


GEST Human secreted protein, SEQ 

iU ri\J. OoZl. 


458 


98 


H23 


gi2951763 


Sugarcane 

■ « 

mosaic virus 


caps id protein 


78 


26 


H23 


gil249618 


Johnsongrass 

• • 

mosaic vrrus 


coat protein 


78 


26 


1 1 1/1 

1 lz4 


1 C\A C3Q71 

giiy4ooy / 1 


rvlus musculus 


isJJsJiJN cuxna jzjU4U11U1 gene 


Zj 1 D 


OJ 


1124 


gi2576348 


Homo sapiens 


Glutamyl tRNA synthetase 


1520 


100 


1124 


gi2 10643 93 


Drosopnila 
melanogaster 




111^ 

1113 


A Q 

4JS 


1 1 1 c 

1 125 


AAB82047 


TT * „ 

Homo sapiens 


IGAK- Human mast cell surface 
antigen. 


11/11 
2141 


1 AA 


1 1 1 c 

1125 


gil2654783 


TT. _ ■ _ 

Homo sapiens 


Similar to loss of heterozygosity, 11, 
chromosomal region 2, gene A 


1 1 /i 1 
Zl41 


1 AA 


1 1 1 c 

1 125 


gi2 190974 


T T - ~ 

Homo sapiens 


breast cancer suppressor candidate 1 




1 AA 


1126 


AAB95305 


Homo sapiens 


HELI- Human protein sequence SEQ 

ID JNU: 1 /j>44. 


3327 


99 


1126 


AAB92704 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 11 106. 


2993 


99 


1 1 1 ZT 

1 126 


A A Tt A j| 1 T7 

AAB94137 


T T * 

Homo sapiens 


HELI- Human protein sequence SEQ 

tta TvT/^V. 1 A ACk A 

ID INvJ. 144U4. 


z944 


1 AA 


1 1 T7 
1 IZ 1 


gll4oolUD(J 


Ormthorhynchus 
anatinus 


sultotransierase oUlvl 1A 


to 


jy 


1 1 n 

1 iz / 


rt'CAO 1 /CI A 

gOUoloz4 


Emericella 
mctuians 


mo lyo clop term syntnase large suoumt 

L^nXrl 


/U 


D 1 


1127 


gi|21292708 

lrrVilTJ A A C\A Q 

|gD|ii/\AU4o 

53.1| 


Anopheles 
gamoiae sir. 
PEST 


agCP4275 


80 


29 


1 1 OS 
1 IZo 


glD4D /oUD 


riomo sapiens 


qjiuodl/z.o ^pairea dox gene i ) 




1 nn 


1128 


gi9501358 


Mus musculus 


Paxl protein 


1681 


90 


1 1 9S 




A/lnc tnnopiiliio 
IVlUb IIlUbL-UlUij 


pAVI 
rAAl 


1681 


90 


1129 


gil 8642466 


Homo sapiens 


EMSY protein 


1915 


99 


1129 


AAW82571 


Homo sapiens 


CANC- Human BBP1 DNA. 


1825 


95 


1129 


AAM93627 


Homo sapiens 


HELI- Human polypeptide, SEQ ID 
NO: 3466. 


623 


100 


1130 


AAY10834 


Homo sapiens 


HUMA- Amino acid sequence of a 
human secreted protein. 


1603 


99 


1130 


AAB93801 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 13573. 


1597 


99 
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SEQ 

Tip* 


Hit ID 


Species 


Description 


S 

score 


Percent 
identitv 

tvl V/ il Arm %* J 


1130 


gil 7946001 


Drosophila 
meianogasier 


RE50056p 


254 


31 


1 1 1 1 

1 1 D I 


A A 1 1U534 


xiomo sapiens 


Xl LJ lVX/V* zAIIllIlU ctL-lU. oCC[UC11UC \JL Cl 

liniriiin cf a r»'t*f : *i"prl trm'i'Pin 

llLlilla.ll oCLICICU JiJUJtt/lll. 


1565 


100 


1131 


AAB93801 


Homo sapiens 


HELI- Human protein sequence SEQ 

XJ-/ 1 > w . 1 J J / J. 


1559 


99 


1131 


gil7946001 


Drosophila 
meianog<±b icr 


RE50056p 


253 


31 


1132 


gil 504002 


Homo sapiens 


similar to a human major CRK-binding 
protein DOCK180. 


9481 


99 


1 132 


gllJ19514/ 


Mus museums 




0097 


Q4 


1132 


gil339910 


Homo sapiens 


DOCK180 protein 


5541 


62 


1134 


gi3600087 


Dictyostelmm 
disco ideum 


cytosolic legulator pianissimo 


yj 


j't 


1 134 


gi341 1250 


Dictyostelium 
discoideum 


aevelopmentai protem 


yj 


1.A 


1 i O/l 

1 134 


• 1 fAI 1 Q AO 

gilDUl liUo 


Human 

immunodeficienc 
y virus type i 


envelope giycopioiem 


oz> 


97 


1135 


gil3543686 


Homo sapiens 


Similar to RIKEN cDNA 4931428F02 
gene 


1483 


98 


1135 


AAY07081 


Homo sapiens 


LUDW- Renal cancer associated 
anugen precuisor sequence. 


255 


40 


1135 


AAE13761 


Homo sapiens 


CORI- Human lung tumour-specific 
protein lioo-i^. 


254 


37 


1136 


gil4250138 


Homo sapiens 


Similar to RIKEN cDNA 573042 1E1 8 
gene 


1377 


100 


1136 


gil8605826 


Mus musculus 


Similar to RIKEN cDNA 5730421E18 
gene 


887 


91 


1 1 l£Z 

1 l3o 


A A T TOH/I 1 Q 

AAUZU4 iy 


xiomo sapiens 


xtuivla.- xiuman secreceo proieui, ocq 
TT1 "NTo 4 1 1 


S^O 




113/ 


A A T TOfl/1 Q C 


xiomo sapiens 


T-TT TN/T A T-TiTmon c*= , r"T"f a tf a rl i^TT^tf^i n Qpn 
11 LJ n.U.XIlall oCOIClCU JpiUlC-llIj OCCJ^ 

ID No 480. 


643 


83 


1 1 in 
113/ 


A A A/TQQ COO 

AAJV1 y 3 DZZ 


xiomo sapiens 


ririivi- numan poiypeputie, oc-v^ llj 


LIT - J 


OJ 


1137 


AAY57916 


Homo sapiens 


INCY- Human transmembrane protein 


643 


83 


1138 


gi6934248 


Homo sapiens 


tropomodulin 4 


1536 


89 




rril 01 AAl f>D 

gllZ / 44 / oz 


xiomo sapiens 


IllUbL-iC lIUpUlllVJU.U.1111 


1536 


89 


1138 


gi6013189 


Homo sapiens 


Sk-tropomodulin 


1530 


89 


1 1 1Q 

i i3y 


AArS43yUU 


riomo sapiens 


rTUivi/\- jnuman cancer dobocicticu 
protein sequence SEQ ID NO: 1345. 


516 


89 

0.7 


1 1 'JQ 

i i3y 


gilooo/UZ 


ivius muscuius 


riDOSomai proiem 


S16 


89 


1 1 io 
1 13y 


minion 
glzUo /3Z 


Kattus 
norvegicus 


riDOSomai proiem i_/joa 


516 


89 


1 1 /i n 
1 14U 


AACjrUUoo4 


Homo sapiens 


Oxio i numan secretea proiem, oxi^ 
ID NO* 4745 


619 


100 


1140 


AAG00663 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID NO: 4744. 


489 


100 


1140 


ABB04721 


Homo sapiens 


SHAN- Human PP2464 protein SEQ 
ID NO: 17. 


89 


26 


1141 


gil4250752 


Homo sapiens 


Similar to hect domain and RLD 2 


890 


100 


1141 


gil6648386 


Drosophila 
melanogaster 


LD39062p 


347 


31 


1141 


gi4079809 


Homo sapiens 


HERC2 


288 


32 



wo 



03/029271 



PCT/US02/30474 



254 
Table 2B 



SEQ 
ID 


Hit ID 


Species 


Description 


o 

score 


Jrercent 
identity 


1142 


gil5277241 


Homo sapiens 


Testis expressed gene 


485 


90 


1142 


gi3 176438 


TT * 

Homo sapiens 


HCG V 


/IOC 


90 


1142 


gil 1322969 


Homo sapiens 


protein phosphatase 1 , regulatory 
(inhibitor) subunit 1 1 


485 


90 


1 143 


AAB64461 


Homo sapiens 


HUM A- Human secreted protein 
sequence encoded by gene 40 SEQ ID 
NO:99. 


272 


100 


1143 


AAB75511 


Homo sapiens 


ROSE/ Human secreted protein 
sequence encoded by gene 6 SEQ ID 
NO:65. 


272 


100 


1143 


AAB51649 


Homo sapiens 


ROSE/ Human secreted protein 
sequence encoded by gene 30 SEQ ID 

\Tn.on 


272 


100 


1 144 


AAB43o2 / 


Homo sapiens 


HUJVIA- Human cancer associated 
piotein sequence oJii^ iu jnu. iz /z. 


CI c 

7> 1 J 


Q1 


ii/i/i 
1 144 


' 1 "7A/1 A C\1 A 

gil / 944104 


Drosopnila 
melanogaster 


KrSUZZyZp 


A C\C\ 




ii/i/i 
1 144 


A A T> /ZA 1H A 

AABo4J /4 


Homo sapiens 


1JN C Y - Amino acid, sequence 01 Human 
intracellular signalling molecule 

1JN 1KAO. 




90 

zy 


1 140 


1 /l C/l QOA'7 

gll4D4yZU / 


Homo sapiens 


T TT? A Y1 
UKAA1 


*X 1 1 Q 
Olio 


1 nn 

1UU 


1 1 A C 

1 14j> 


gil Iy\j 1 /yj 


Homo sapiens 


ivrJ^-ueta mauceu apoiosis protem d 


"5 1 1 C 


i nn 

1UU 


1145 


AAM51815 


Homo sapiens 


BADI Human apoptase LI 00. 


3117 


99 


1 1 A 

1 14o 


gil jyb 19M 


Yersinia pestis 


L-seryl-tRNA selenium transferase 


O/ 


4 / 


1147 


AAG74155 


Homo sapiens 


HUMA- Human colon cancer antigen 

,_ „ i ■ O T7 TF\ \TA./lfl1 n 

protein oni^ ID JNvJ:4yiy. 


1118 


98 


1 1 /I *7 

114/ 


a a "q /no 
AAd jJ4Zy 


Homo sapiens 


HUJVIA- Human colon cancer antigen 
protein sequence l±j i\\j,yoy , 


1 1 1 Q 
1 1 1 O 


OQ 


1 1 47 


gl 1 7jJjO / 3 


lYius muscuius 


jviJSJj/iN cj-/iN/\ zziu^t-i /IJU7 gene 


yuj 


/ 3 


1 1 48 




ivius muscuius 


MHO C 
IN VjZo 




7^ 
/ J 


1148 


AAM38700 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 

IN w io'tJ. 


753 


42 


1 1 AR 

1 l^f-o 


/\/\lVlJ> 0O77 


n omo sapiens 


n i oxZ/- m urn till poiypcp liqc any iu 
INTO 


740 




1 14Q 




Ayfnc miicpiilnc 
iviua IIIU.0CU.IU.0 


XIJ.ULL11.1C1 _) ^lViJJ y 


O 1 C 1 




1149 


gi2266988 


Mus muscuius 


M33 polycomb-like protein 


2161 


82 


1 1 AQ 


glJOOU 1 OJ 


yvenopus lae vis 


roiyconiD nomoiog rci 






1150 


gil5213542 


Homo sapiens 


NSD1 


1618 


100 


1 1 

1 IDU 


■ -1 /-^7 c 1 o 
gllo/Mzo9 


Homo sapiens 


androgen receptor associated 
coregulator Zo / -b 


1014 


1UU 


1150 


gi3329465 


Mus muscuius 


NSD1 protein 


1455 


90 


1 1 C 1 

1151 


AAU75614 


Homo sapiens 


BIO W- Human RS3 protem 18. 


yzz 


1UU 


1151 


gil2005728 


Homo sapiens 


GLOlz 


9zz 


1 AA 

100 


1151 


AAU75615 


Homo sapiens 


BIOW- Human RS3 protein 18, N 
terminal peptide. 


78 


100 


1 1 CO 

1 1 52 


A A T TO O^n 

AAUozUl/ 


Homo sapiens 


INCY- Human secreted protein 

OJ3V^lT *+ J . 


Z404 


o / 


1152 


gil 86767 16 


Homo sapiens 


FLJ00257 protein 


2356 


84 


1152 


AAM00753 


Homo sapiens 


HYSE- Human bone marrow protein, 
SEQ ID NO: 116. 


1640 


91 


1153 


gi6572156 


Homo sapiens 


dJ1014D13.3 (novel protein) 


1783 


100 


1153 


gi6572155 


Homo sapiens 


dJ1014D13.2 (novel protein similar to 
ACTN3 (actinin, alpha 3)) 


1023 


100 


1153 


gil 8676484 


Homo sapiens 


FLJ00 139 protein 


704 


35 


1154 


AAG03284 


Homo sapiens 


GEST Human secreted protein, SEQ 


364 


100 
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ID 


xllt 1JJ 


opecies 


U ca CI ip ll U 11 


s 

score 


Percent 
identity 








1JJ INV^J. / jOj. 






1154 


gi4778 


Saccharomyces 

* * 

cere vis lae 


Usol protein 


119 


22 


1 154 


gl28533Ul 


Homo sapiens 


• 

mucin 


107 


20 


1155 


AAM25280 


Homo sapiens 


HYSE- Human protein sequence SEQ 

1U iNU. / 7J. 


1908 


100 


1 1 c c 

1 1 J J 


gll933ooyZ 


jvlus museums 


iN-aceryigiutamate & ynind.be 


1780 


92 


1 1 f f 

1 1 JJ 


AAr>5o4U5 


Homo sapiens 


f~\ O "II? / T finrr r^dno^i* Q c c/"\f*i Ci "f^fi 
KUoH/ XjlXIlg UalK/CI aooUUlciLCU. 

pOiypCpilUC oCLJLICII^C 0-Lj\/ XJL>' / *T J . 


814 


99 


1156 


AAW21949 


Homo sapiens 


NEWE- E6-binding protein E6- 

RP9D7 


1678 


100 


1156 


gi469885 


Homo sapiens 


EF-hand protein 


1678 


100 


1 1 C/£ 
1 156 


gll343ol jz 


Homo sapiens 


retlCUlOCalDlll Z., rJ,r-IlailU. l^alL-lUllJ. 

binding domain 


1678 


100 


1 1 C7 

115/ 


giloo /0054 


Homo sapiens 


Jri^juuzzo proiem 


JU7T 


99 


1 1 

115/ 


giizy3oy3 


ivius museums 


leucine zipper piuieiii i 


4324 


80 

uu 


1157 


gil9880160 


Mus musculus 


leucine zipper motif-containing protein 


4295 


80 


1 158 


AAB94413 


Homo sapiens 


rxxii-fi- numan proiem sequence ony 
ID NO: 15003. 


1 044 


94 


1 1 CO 

1 158 


gilZo(J43o5 


rlomo sapiens 


H i uuz proiein, nypenenbion-reidLeu 
calcium-regulated gene 


1 044 


94 


H58 


AAB93000 


Homo sapiens 


HELI- Human protein sequence SEQ 

1 JJ IN LJ . 1 1 ID 1 . 


1038 


93 


1 159 


gi33 87790 


Homo sapiens 


"DTD 1 

F1K1 


1 407 


OVJ 


1 159 


gil2653157 


rlomo sapiens 


dual speciricity pnospnatase 1 1 
(RNA/RNP complex 1 -interacting) 




7Q 


1159 


AAG01376 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID JNw: 545 /. 


542 


100 


1160 


AAM88829 


Homo sapiens 


HUMA- Human 

irnmune/haematopoietic antigen SEQ 

1U JNU: 104ZZ. 


1017 


94 


1160 


gi5901688 


Mus musculus 


GRIN1 


251 


24 


1 160 


gi5650774 


Gallus gallus 


interacting protein 6 


1 R^i 
loO 


jZ 


1161 


AAM67381 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein SEQ 

1JJ Nkj: Z /OO /. 


814 


100 


1161 


AAM54995 


Homo sapiens 


MOLE- Human brain expressed single 
exon prooe encouea. protein oriv^; lu 
NO: 27100. 


814 


100 


1161 


AAU 16451 


Homo sapiens 


HUMA- Human novel secreted protein, 

0~,r« m 1 A A/I 

beq 11J 14U4. 


630 


100 


1162 


gil3661556 


Homo sapiens 


extracellular glycoprotein EMILIN-2 
precursor 


1410 


98 


1162 


gi2 1434745 


Mus musculus 


basilin 




/ 1 


1162 


AAM99923 


Homo sapiens 


HUMA- Human polypeptide SEQ ID 
NO 39. 


1096 


93 


1 163 


AAM /5 bZ / 


rlomo sapiens 


ivivji^xi- Human none marrow 
expressed probe encoded protein SEQ 
ID NO: 36133. 


648 


100 


1163 


AAM63016 


Homo sapiens 


MOLE- Human brain expressed single 
exon probe encoded protein SEQ ID 
NO: 35121. 


648 


100 


1163 


gi46363 


Rhodospirillum 
rubrum 


URP2 


88 


25 


1164 


AAU76957 


Homo sapiens 


CYTO- Novel human kinesin motor 


4649 


100 
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SEQ 
ID 


Hit ID 


Soecies 


Dpsrrintinn 

X»t3L 1 1 1 J 11U11 


i S 
score 


Pprppnt 

identity 








orotein HsKit)3d 






1164 


gi2 1104476 


Homo sapiens 


OK/SW-CL 108 


4637 


99 


1164 


AAU76958 


Homo s aniens 


CYTO- TsJovel lnimaTi lcine^in motor 

> JU X W X i V-* V vX XI L4-A A ACl aa aVXXXV*" O XXX XXXV LwX 

protein, HsKip3d motor domain. 


1 1835 


100 


1165 


AAE04764 


Homo s aniens 

-i-A-V^'AAA^' kJML/xVAlk/ 


INCY- Human vesicle trafficking 
protein-7 (VETRP-7) protein. 


2533 


100 

X wv/ 


1165 


gil5186738 


Mus musculus 


Tac2-N 


1 2226 


87 
o / 


1165 


AAU 19746 


Homo sapiens 


HUMA- Human novel extracellular 

matri'y TYroteiTi Sph TD Nn ^Q/i 


998 


100 


1166 


gi2760351 


Girardia tigrina 


myosin heavy chain 


197 


23 


1166 


ei3986194 


T")iicrp^ia iaixonipa 

X^ IXc^t'OXC* j <XVJ\J Ll.l\jCX 


tttv/ociti Tip aw pTiain 
1 Lxy UollxllCJclvy w lxclll.1 


1 1 Q7 
j ±y / 




1166 


gil8652658 


Schmidtea 

tried itPTrari pa 

lllvUl t V^X X ClXXw CL 


myosin heavy chain A 


179 


22 


1167 


A A Y5 6021 


Homo sapiens 


VLAA- Human CD40 receptor 

intpTflptino nrnfpiti A C~*A 
ilxlCIa.L'LlIlg jJiLllCxll t T^ t t. 


2121 


99 


1167 


AA012585 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 76477 


548 


96 


1167 


AAM40573 




n. i on- XAUiiiaii poiypepnac 0x1^/ xjl/ 
NO 5504. 




99 

ZZ 


1168 


gi20 126671 


Rattus 

norvpcicn <? 

llvl V vglvUu 


socius 


1971 


78 


1168 


gi 18044095 


Mus musculus 


Similar to RIKEN cDNA 4930506L07 

o-frip 


1199 


82 


1168 


eil3195163 


Rattus 
norvegicus 


<?troTTia1 anticrpn ^ 




4^ 


1169 


gil 8072031 


Homo sat>iens 


!7inc fiiiO'pr nrntpin ^^?R 

Z-i IX J. w XXlJ.&t'X L^X Lwlll O 






1169 


gil4456629 


Homo sapiens 


dJ54B20.2 (novel KRAB box 

pfititainino" f ^TT7 tA/np fzinp "fincrf^r 

protein) 


220 


52 


1169 

J. A. v-*' 


AAOH585 

xxx». Vw/ x i *^ Uw^ 


XAW1±1W> Od^'ICllO 


III OJ_»- XxUilxaxx pUiypcpiltlC OJ_yV^ 11^ 

NO 25477. 






1170 


gi20381091 


Homo sapiens 


similar to rab 1 1 -binding protein 


85 


26 


1170 


ei257041 


z^cd iJ.ia.yo 


iiyuioxyproiinc--ricii giycoproi6iii ? 
HRGP 






1170 


gi4007865 


Zea mays 


Hydroxyproline-rich Glycoprotein 

^JTlXvvJX J j 


83 


23 


1171 


gil 1415020 


Rattus 

I1UJL VCglUUb 


calcium channel alpha- 1 -I subunit 


70 


28 


1171 


gi| 18593484 

Irefivp 00? 
855 11 


Homo sapiens 


similar to interspersed repeat antigen 1 


73 


32 


1171 


ei!20904051 
IreflXP 139 

1 A. w A 1 kl A w' 

476.1| 


A/TiiQ tniicrMiliic 

IViUa lIJ.L4.ol/ UllXb 


oiiiiiiai lu v LiiLa^c-u.cpciiLicnx. x ~iypc 

palpiiim pVinnnpl n1nV»^-1T QiiVMini+ 
clival v^llclllilCl ctipiia- 1 1 bUL/LlIlil 

(CAVT.3) 


/ 1 




1172 


AAB94634 


Homo ^anipriQ 


WPT T- T"Ti 1 Tna ~n ■nfni'Pi'n cpnnpn cf* ^sT^O 

X X J_/XjX XXLiilXdXl LJlUlV^lll oCUUCliLC OijV^ 

ID NO: 15508. 


1 144 


47 


1172 


AAW52187 


Homo sapiens 


TEXA Human BRC A 1 -associated 
protein (hBRAP) primary sequence. 


1144 


47 


1172 


gil 0442700 


Homo sapiens 


zinc-finger protein ZBRK 1 j 


1144 


47 


1173 


gi21410587 


Homo sapiens 


similar to RIKEN cDNA 23 1 004 1H06 


528 


100 


1173 


gi6855513 


Gallus gallus 


syndesmos ] 


414 


64 


1173 


gil 9343886 


Mus musculus 


Similar to RIKEN cDNA 1 1 10001K21 
gene 


407 


60 


1174 


AAU18886 


Homo sapiens 


HUMA- Novel prostate gland antigen, 
Seq ID No 185. 


370 


97 
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ID 


Jtilt ID 


Species 


Description 


c 

score 


identity 


1174 


AAM96038 


Homo sapiens 


HUMA- Human reproductive system 
related antigen orA^? id inu. 4oyo. 


370 


97 


1 1 "7 A 

11/4 


glliJ lU/oZ 




Mus musculus 


• 

myoneunn 


Zj 1 


79 


1 1 *7 C 

11/5 


'H 1 AC /I OA 

gizl llo4oU 


Kattus 
norvegicus 


embryo -relate a protein 


IjjO 


Q7 

y^ 


1 1 7< 

1 1 / 0 


giozyzzuj 


urosopmia 
melanogaster 


i-/D4ZUJOp 






1 17c 


AArSDooyy 


nomo sapiens 


nuiviA- jjieast ana ovarian cancer 
associatca dnngen proicui bccjuciiL/C 
Qpn TD 607 




QQ 


1 1 76 


cri907R005R 


1 HJJ.11VJ od[JlCllo 


^[iTnilar tA T^R AA/T-1 ArAtfin 


2636 


99 


1176 


gill558109 


Homo sapiens 


adaptor molecule- 1 


2572 


90 


1 1 76 
ll/O 


rri 1 7Q7 87^7 

gl IjVj ojj / 


nomo sapiens 


Similar to PTRTFTM pDTsJA 061 DO^OWl 1 
ollllllar lO XVIIVJJ/IN ClvlN-rt. UDlUUjUni 1 

gene 




oo 


1 1 77 
11// 


A AT3Q771 1 
AAJoVj /ll 


nomo sapiens 


jnxiJUl- iimiiaii proiem oct^uciit'C oiz^v^ 
ID NO: 13305. 


7044 


1 00 


1 1 77 
11// 


glZ l^Z / JOo 


nomo sapiens 


AinaDascdn {j\^ixj uLaiiocript vd.rid.iiL *+ 


7074 

JUJT 


00 


1 1 77 
11// 


rri9 1 A97^S6 
glZ l^fZ /350 


nomo sapiens 


/vinaodscan oL-iu udiiboripL vdridiii o 


7074 




1 1 75 
ll/o 


rri 1 Q/^7£Azl6 
gll oO / O^fHO 


riorrio Sapiens 


x^i-fjuuizu protein 




1 00 


1 1 75 
ll/O 


rri 1 70A69QQ 

gii /u^+ozyy 


riomo sapiens 


v^AJKJL'-coniaiiiiiig iv.Lft.vjLj js» proiem 
CARMAl 




1 00 

1 \J\J 


1 170 


A A T TO 1 OH7 
AAUUlzU / 


nomo sapiens 


lviijui-j- numan caspase recruiimeni 
domain, CARD-11 polypeptide. 


1 000 
i yyy 




I 1 70 

I I /y 




nomo sapiens 


oAvjta Ammo acia sequence oi xne 
numan iype-1 memordne proiem. 


60 1 
oy i 


1 00 
1 \J\J 


1179 


gi4583677 


Homo sapiens 


p24B protein 


691 


100 


1 1 70 
1 1 /y 


gn oH-yuiuy 


nomo sapiens 


integral type i protein 


6Q 1 

oy i 


1 00 
1 uu 


1180 


AAB47327 


Homo sapiens 


CURA- FCTR4. 


3068 


99 


1 1 on 
1 loU 


A AT31 Q1 1 0 

AAH 1 0 1 1 0 


nomo sapiens 


rviii^iv- numan guanine nucieotiae 
exchange factor (GEF) 32529 protein. 




QO 

yy 


I 1 Qfl 

I I oU 


Ar5.t>U0Uj 1 


nomo sapiens 


v^LJivix^- numan ino protein sequence 
SEQ ID NO: 143. 


9Q69 

zyoz 


Q7 

y / 


1101 
1 1 0 1 


m' 1 Cl&'Z&A QA 

gllUO.D04o4 


nomo sapiens 


poiygiutamme-coniammg proiem 


A1 07 


1 oo 


1181 


AAG81247 


Homo sapiens 


ZYMO Human AFP protein sequence 
cth o rn "Mo* 1 9 


1180 


100 


1181 


AAM78881 


Homo sapiens 


HYSE- Human protein SEQ ID NO 
1543. 


1072 


44 


1 1 00 

1 182 


AAA A A A7 O A 


Homo sapiens 


MYlSxi- Human polypeptide c>Jbv^ ID 
NO 3465. 


1 07 
1UJ 


9/1 
Z4 


1 182 


A A "\ >T A AO 1 A 


Homo sapiens 


HYbE- Human polypeptiae bbC^ ID 
NO 3464. 


1 A7 
1UJ 


7 /I 

Z4 


1100 
1 182 


AAB93849 


Homo sapiens 


HELI- Human protein sequence SEQ 
IDNO:13697. 


yo 


7 ^ 

ZD 


1 183 


A AB 74944 


Homo sapiens 


YAM A Human ADAM type metal 
protease MDTS 1 protein SEQ ID 
NO:l. 


1 AA 

iuy 


z / 


1 1 R7 


A A RR6Q4Q 

AADooy^ty 


nomo sapiens 


nurr nunidii uicidiiupiuLCaoc ivur i o- 

19 protein. 


109 


11 


1183 


gi8977890 


Homo sapiens 


ADAMTS7, alternatively spliced 
product 


109 


21 


1184 


AAB94379 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 14928. 


3306 


99 


1184 


gi20072682 


Mus musculus 


actin-related protein 8 homolog (S. 
cerevisiae) 


3241 


97 


1184 


AAG66735 


Homo sapiens 


UYFU- Actin41. 


1960 


99 
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Description 


S 

score 


Percent 
identity 


1185 


gil7066106 


Homo sapiens 


Novex-3 Titin Isoform 


10131 


99 


i i o;: 
1 I OD 


A a pni^^n 

/\/Vjj/U4j>OU 


xiomo sapiens 


rvrr^v tinman Vinasp /'PKTNVl 

II^JV^ I STL Ullla.il JvllldoC/ ^JTl^JLJ.^ ) x . 


2456 


100 


1185 


AAU 17981 


Homo sapiens 


HUMA- Human immunoglobulin 

pUiypcpilLlC OJuy 1 \-J 1>LI 1 jLKJ . 


1345 


99 


1186 


gil3276231 


Homo sapiens 


FYVE and coiled-coil domain 

L/UIlLd.llllllg X 


7470 


100 


1186 


gil 8250726 


Mus musculus 


FYVE and coiled-coil domain 
containing i 


5550 


75 


1 loo 


gil jyJolyj 


xlomo sapiens 


r i vjj ano conea-cou uorriaiii 

L-UXlLallllllg 1 


1 971 

X £t / 1 


98 


1 1 QH 
1 1 O / 


A ATlOyl^Qn 

AAoy^oyu 


riomo sapiens 


XJTh T T TTnmon nmtpin QpniipriPf* SIT^O 
xxxZiX^iX XlUlllali |J1 kJ LCiIl oCl^uciJlvL OJuy 

TD NO-1 S^98 

ll_y l\U.lJJ7Ui 


1290 

x j—f y \j 


100 

X \J \J 


1 1 9.1 
I 1 o / 


en 1 ^0^81 OS 

gl 1 J7J O 1 "J 


Iavjiiuj od^Jiciio 


"R*Y"VP anH roiled-coil doTnaiti 
contaitiinff 1 


1263 


99 


1 1 87 

J. 1 o / 


ail 8250726 


"fVfnc mil renins 


FYVE and coiled-coil domain 
containing 1 


847 


70 


1188 


eil235527 


Homo saoiens 


myoglobin 


809 


100 


1 188 


eil5778933 


Homo sat)iens 

JL JL -A J- -J. W M JL VIA Lp* 


Similar to myoglobin 


809 


100 


1188 


gi386872 


Homo sapiens 


myoglobin 


806 


99 


1 1 89 


p-i 145 87078 


Danio rerio 

X-^ Ct-XXXV^ Ivllv 


Unci 19c 

V-/ llv JL> X w 


851 


69 


1189 


gil4587076 


Danio rerio 


Unci 19b 


780 


76 


1189 


gi4103940 


Mus musculus 


UNC-119 


701 


60 


1 1 on 

1 1 yu 


cn'9 1 ^ 

glZ 1 J uj'r 1 -> 


XJ.U111U bciJJlCllo 


aHfTTvlo^nprinatp ^vntViptasp iso7vme 

dLl^/ll V IVJiD Uv^ V^llld Lv^ OyilUlVLuJV AC3\J^-iJ' lllu 


2399 


100 


1 1 on 


o-izinA7S 1 
gl-4-v-+ / D 1 


iVXUo lliUSCUlUS 


ciH<=»Ti\/1n<inr , r , iTifitf a «ivntlTPta^P 

U.L1C11 y lUo U-^OlllU. Lt/ O jrllLHO LOO^/ 


2353 


96 


1 1 on 


rj\A 1 S8ZlO 

gi-+ 1 j o-+y 


XlUlllU bctpiCXlo 


5iHpn"\Alricnppi"tifii'P Qvntnptfi <?p 

dLlCJJL y lUiD U-\^\_/lllClLw O yillllwlaO^; 


1861 

x v.* X 


77 


1191 


gil0834696 


Homo sapiens 


PP4189 


744 


98 


1101 
1 iy 1 


rvil ^OQ 1 TO 1 

gn jUo 1 / y 1 


/\raDiciopsis 
thaliana 


At1 O009R0/T1 9M4 1 

xA.LlgWi/ZOV/ 1 1 ■£1V1*t_ 1 


510 


36 


1101 
1 iy 1 


gll J /UZ04 / 


otapnyiococcus 
aureus subsp. 
aureus i^jij 


conservcQ xiypuiciiLiL/ai piutem 


4^8 




1192 


gi2970646 


Mus musculus 


Xin 


3605 


52 


1 1 00 
1 lyz 


gizy /Uo44 


vjraiius gaiius 


Yin 

Ain 


1 777 
i / / / 




1192 


AAO05743 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 


629 


93 


1 iyj 


rri 1 Q^/IIS'nQ 

gliy D^DODy 


ivius museums 


CJ •» i"v»"i lot* tr\ "i7£^cir*1p» qthiup trancnnTt 
ollllllal IU VCS1L/1C dllllllC lla.liojJlJi l 

protein 1 


1 826 

X UZ<v 


87 


1 1 


giouzz / 0 


noino sapiens 


SUIXlldl LU Jra-LvlllL/ xay V r\ x x LJIUIC/JUI, 

^\7i/i<:<3-Prr»t Arrpniinn TsJnmbpr PI 


1579 


99 


1193 


gil2804921 


Homo sapiens 


Similar to membrane protein of 
cholinergic synaptic vesicles 


1556 


100 


1 1 OA 

1 iy4 


A A T3QO</1 1 


xlomo sapiens 


A PRD Hnmon C*T A QP 7 rvrntpiri 

cpnnptipp QFD TF) 7STfV9 
oCl[UCllWC ui-/y xLJ 1> W.z,, 


4269 


99 


1194 


gil4598037 


Homo sapiens 


human CLASP-7 


4269 


99 


1 1 O/l 

1 iy4 


AAr>yy4y4 


riomo sapiens 


/A.rvlD w~ JTU.rila.ll V^Lriuf "J piULClll 

cpnnpnrp T7ior zl 
bCLjUCllL/C Ig 


^153 


72 


1195 


gil 1493417 

C^l X X m-J 1 X / 


Homo saniens 


PRO 1292 


244 


100 


1196 


AAB60467 


Homo sapiens 


INCY- Human cell cycle and 
proliferation protein CCYPR-15, SEQ 
ID NO:15. 


972 


100 


1196 


AAG93315 


Homo sapiens 


NISC- Human protein HP10456. 


972 


100 


1196 


AAM41305 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 6236. 


972 


100 


1197 


AAU78233 


Homo sapiens 


PFIZ Human VDUP-1 protein. 


1631 


99 


1197 


gil 1559490 


Homo sapiens 


VDUP1 


1631 


99 
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c 

score 


identity 


1197 


gill 17464 


T T * - - - 

Homo sapiens 


bram-expressea HrlCrA/c5 nomoiog 
VDUP1 


1 1 


QQ 


1198 


gil4906268 


Homo sapiens 


Pur-beta 


1 Ojo 


1 no 

1 


1198 


gi2460H9 


Mus musculus 


vascular actin single-stranded DNA- 
binding iactor z p44 component, 
purine-rich single-stranded DNA- 
omarng protein ueta, Jruri3 oeta 


1573 


94 


1198 


AAV31720 
aal 


Homo sapiens 


MOUN Nucleotide sequence of the 
PUR-alpha gene. 


990 


71 


1199 


gil7945201 


Drosopnila 
melanogaster 


Rblz40zp 


4o / 




1199 


gil7945l28 


Drosopnila 
melanogaster 


Ti "D AO m O** 

K_b0o93zp 


4DI? 


9R 
Zo 


1199 


gi5668787 


Arabidopsis 
thaliana 


T?0 1 T TO 1 O 

F21H2.12 


o 1 


9S 
Zo 


1200 


AAM88553 


Homo sapiens 


T TT TA K A TT 

nUMA- Human 

immune/ naematopoie tic anxigen oiiv^ 
ID NO:l6l46. 


ozo 




1200 


gi 17945760 


Drosopnila 
melanogaster 


T> T77T'2 A1-rt 

KJti333UZp 






1 1AA 


gllU3y44 / 


oaccnaromyces 
cerevisiae 


T t^V\ 1 
LpDlp 


31 T 


30 


1ZU 1 


glloulUlo / 


nomo sapiens 


in oiiz pro te in 


1 663 

J. UUJ 


100 ' 


1201 


gi20070972 


Homo sapiens 


similar to NSE1 protein 


618 


46 


IzUl 


gilo5Uj /3 / 


Homo sapiens 


iNoiii protein 


uio 


46 


1202 


gil9353084 


Homo sapiens 


Similar to RIKEN cDNA 1700026A16 
gene 


3951 


100 


1202 


gil3561413 


Homo sapiens 


F-BOX domain protein 


2076 


99 


1202 


gi65olo31 


Homo sapiens 


muscle uisease-reiatea protein 


1 UJ*T 




1203 


gizl99oU 


Homo sapiens 


paraneoplastic cereoeiiar aegenerdtion- 
associated antigen 




1 \J\J 


1203 


g\24llll 


Homo sapiens 


autoantigen recognizea oy an ann- 
neuronal cell antibody 


zzt_> 


1 VJv/ 


1203 


gil7Q283o3 


Homo sapiens 


similar to cereoeiiar aegeneration- 
related 2 


ZZt - • 


1 00 

1UU 


1204 


gill 870006 


Rattus 
norvegicus 


nuclear matrix transcription iactor 


/ J / 


o 1 


1204 


gil 87003 13 


Homo sapiens 


nuclear matrix transcription factor4 


720 


80 


1204 


■ A Ann 

gi6729087 


Rattus 
norvegicus 


Cas-associatea zmc linger protein 


71 £ 
/ ID 


QO 
oU 


1205 


A A T"» A /I O A 1 

AAB 94801 


TT „ 

Homo sapiens 


HxiLl- Human protein sequence 
11J JNU. oy3 


40O 


1 OO 


1205 


AAO 12222 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 

1NU Z01 14. 


466 


100 


1205 


gil3537206 


Homo sapiens 


Mel 18 and Bmil like ring finger 


466 


100 


1206 


"1 O /I '"7 O ^ C~l 

gil8478557 


T* i i 

Rattus 

iiurvcgiL-uo 


paraneoplastic onconeuronal protein 
MA 1 


1 Q 1 
lol 




1206 


AAB74701 


Homo sapiens 


INCY- Human membrane associated 
protein MEMAP-7. 


178 


48 


1206 


gil4030861 


Homo sapiens 


paraneoplastic neuronal antigen MAI 


178 


48 


1207 


AAY72162 


Homo sapiens 


BAUG/ Human RNA metabolism 
protein (RMEP-2). 


2791 


90 


1207 


AAB56895 


Homo sapiens 


ROSE/ Human prostate cancer antigen 
protein sequence SEQ ID NO: 1473. 


2759 


97 


1207 


AAW74802 


Homo sapiens 


HUMA- Human secreted protein 


2759 


97 



wo 
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SEQ 
ID 


T"W ■ J. TT\ 

Hit ID 


Species 


Description 


c 

score 


identity 








encoded by gene 15 clone no^JcJLZD. 






1208 


gi21262188 


Homo sapiens 


CTCL tumor antigen LI 4-2 


926 


95 


1208 


gi 15292291 


Drosophila 
melanogaster 


T T"\ A C £ OO 

LD45682p 


1/1 


zo 


1208 


gi20087019 


Plasmodium 
falciparum 


maebl 


157 


28 


1209 


gil5991574 


Homo sapiens 


eferin 


97 


29 


1209 


gi21428632 


Drosophila 
melanogaster 


T T1 AH 1 1 

LP071 lop 




Zo 


1209 


gi4176552 


Schizosaccharom 
yces pombe 


protein kinase bublp 


84 


26 


1210 


gil8257326 


Mus musculus 


RIKEN cDNA 2410002022 gene 


2089 


96 


1210 


AAB95495 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 18037. 


1 Q 1 *7 
lo 1 / 


1 C\C\ 
1UU 


1210 


gil5080536 


TT ■ 

Homo sapiens 


Similar to kikkini cujna z4iuuuzvJzz 
gene 


1UZO 




1211 


gil 8257326 


Mus musculus 


T1T7/"C\T „Tv\T A 1/1 1 AAA1A11 ftQ „ a 

KiKhN cDJNA z4iuuuzvJzz gene 


ZU40 


OA 
V4 


1211 


AAB95495 


Homo sapiens 


HELI- Human protein sequence SEQ 

TTV \T/^. 1 oAn 

ID JNU:loU3 /. 


1774 


98 


1211 


gil5080536 


Homo sapiens 


Similar to K1KLJN cDJNA z41UUUzL>zz 
gene 


yoo 


y / 


1212 


AAU15918 


Homo sapiens 


HUMA- Human novel secreted protein, 
Seq ID 871. 


Z /ZJ 


QQ 


1212 


ABB56422 


TT * 

Homo sapiens 


HTT A \T TT 

SHAN- Human cancer suppressor 
protein PP902. 


1 HH A 
1 / /4 


1 nn 
1UU 


1212 


AAU 16370 


Homo sapiens 


HUMA- Human novel secreted protein, 
Seq ID 13z3. 


14UU 


y© 


1213 


gil9353360 


Mus musculus 


KlrLbN cDJNA zU lUUUULrlD gene 


4 / D 




1213 


gi268 


Bos taurus 


cytochrome c oxidase subunit VIb (AA 
1-86) 


A A < 

44j 


on 
yu 


1213 


AA013877 


Homo sapiens 


HYSb- Human polypeptide bLQ 1JJ 
NO 27769. 


A 01 
4ZJ 


O J 


1214 


gil3279335 


TT ' „ ^ 

Homo sapiens 


plecKstrin Homology, oec / ana 
coiled/coil domains 2 (cytohesin-2) 


1 

1 i?Z4 


07 


1214 


gil575766 


Homo sapiens 


cytonesin-z 


1 Q9A 

lyzH- 


Q7 


1214 


glioo5503 


Mus musculus 


cytonesin-z 


1 Q1 1 


Q? 


1215 


gi5138916 


Homo sapiens 


PTD012 


1116 


99 


1215 


gil6l8264l 


Drosophila 
melanogaster 


(jrH073Ulp 


no 1 

/Z 1 


4 / 


1215 


gil9527625 


Drosophila 
melanogaster 


RE72485p 


665 


44 


1216 


AAM49106 


Homo sapiens 


SHAN- Human beta transduction 
tactor 14. 


776 


100 


1216 


AAG66887 


Homo sapiens 


SHAN- Human zinc ringer protein 17. 


230 


37 


1216 


AAG66795 


Homo sapiens 


BIOW- Human ATP-dependent serine 
protease 16. 


lDZ 




121 / 


glZlooU14D 


Homo sapiens 


Jrj-^juuioo proiem 


17S^ 

j. / j j 


100 


1217 


gil5209353 


Caenorhabditis 
elegans 


Y39B6A.1 


186 


28 


1217 


gi8096269 


Nicotiana 
tabacum 


KED 


153 


23 


1218 


ABB89403 


Homo sapiens 


HUMA- Human polypeptide SEQ ID 
NO 1779. 


792 


96 


1218 


AAB56606 


Homo sapiens 


ROSE/ Human prostate cancer antigen 
protein sequence SEQ ID NO: 1 184. 


792 


96 
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TD 


wit rn 




npcrrinfi on 

1/CJtI 1|JLIU11 


score 


X CI l/Cll I 

identitv 


1218 


AAG01959 


Homo sapiens 


GEST Human secreted protein, SEQ 
TD NO- 6040 


567 


99 


1219 


gill 84949 


Homo sapiens 


pliosphotyrosine independent ligand for 

the T rk SH9 domain r>69 

Ullt- J— /V-/IV UllA> U.VJJL11U.I1J. LJ\J+— 


1821 


94 


1910 

1-617 


m 17^1 9969 


jnuiiiu odLJitJiio 


cpnnp^tncnmp 1 

otu LlCo LVJoVJlllC 1 


1821 


94 


1219 


gil2804857 


Homo sapiens 


Similar to sequestosome 1 


1821 


94 


1 990 


o-i 1 0440^10 

gl I \JH*-t\J D D U 


XlOlTlO Sapiens 


ThT T001 1 S r\rr%t/ain 


400S 
two 


0Q 


1220 


gi5917730 


Homo sapiens 


F~box protein Lilina 


2467 


99 


1ZZU 


A AT3Q1 1 CQ 
/VAJtSyj loo 


xiomo sapiens 


rixii^i- riuman proiern sequence oxi^ 
ID NO: 12 140. 


1 o^o 




1ZZ 1 


A A A/fG^111 


riomo sapiens 


riuivi/\- riuman reproaucxive system 
rciaicu. dnugcn odv^ x.L' inw. jyy 1. 




JO 


1221 


gil4039845 


Homo sapiens 


testes development-related NYD-SP 1 8 


142 


29 


1 to 1 

1ZZ 1 


A A A/TG^/IK 


riomo sapiens 


riuiYi/\- jiuman reproauctive system 
related antigen SEQ ID NO: 4083. 


1ZJ 


10 


1222 


gi9802536 


Arabidopsis 
uiaiiana 


F17L2L25 


97 


23 


1ZZZ 


giiozzo4yo 


/Yrao loop sis 

+•1-1 o 1 1 o n c\ 
Llld.lld.llcl 


A+1 rr77/t^n/T7 1 7T 1 1 
/VtlgZ /^OU/r 1 /IjZ 1 zo 


07 


91 

ZJ 


1 999 


criS00964S 


iiUlllU isapiciib 


117 Z, U1UIC111 


04 


99 


1 991 


cnl 61 Q7R97 

gll vl iy / oz / 


i~j"rf"\crvt"\Tn1 cx 
XJx UoOpilila 

melanogaster 




60S 


S9 


1 991 


A AV6R7R1 

iV/i 1 UO / O J 


1.1U111U OdLlldlo 


llNv-/ X ^A.111111U ClKsXSJ. OCU LlCllUC VJ1 d 

human phosphorylation effector PHSP- 
1 S 

X *S m 


SIS 


41 
" i 


1 993 


pi31 1 6914 
jpi-' 1 1 i « 


JLlUlllU odLUCilo 


jiu U111U.111& yjxvjixiixi 


538 


41 


1 994 


cri 1 R0R740R 

i£l lOuO /T^UO 


_lhjiiiu oauiciib 


111V1VJ - UVJA. LI dliOC'l 1U LHJ11 luu HJJL XJJJ^VCl 


4731 


99 


1 994 


cH1 8087406 


TTr^mr\ csinifMiQ 
1.HJ11HJ odjJlGllo 


TTA/ffr-hrvv" tra"nQr*ri"ntir»"n "fantcsv RRY 

1 llvi. VJ L.' VJA. LI CllloOl iU L1LJ11 laLLUl XJJD-fV 


4731 


99 


1224 


gil8418655 


Homo sapiens 


HMG-box containing protein 


4690 


96 


1 995 


A A"R9 1 049 

AADZ 1 UtZ 


nomo Sapiens 


llVKs 1 ntllllall IltlL'lCl^ etUlU.~DlIlU.lll^ 
rvrrvfv^in "NJn ART*-46 

LJlVJLCIllj 1>J IXTV1_>X *TV1. 


9747 


06 


1 995 


AIVR 1 1 1Q1 

iViJi-) 1 1 1 7J 


injiliu odjJiciib 


T-T"V"^1"P— TTunnsin 7v\ fin ct^t Tvmtf^in 

11 1 OD llU.lllu.ll ZJll llllt^Ci Lyl KJ LC/111 

homologue, SEQ ID NO: 1563. 


9149 


99 

yy 


1 995 


oil 89 0478 R 


"IV/Tnc Tnncf"*n luc 

IVlLtO lllLlo^LllLlo 


RTKFN rDNA 981 041 1K1 6 frene 


1 986 


71 

/ 1 


1 996 


oi99 17970 


1-Tnmn cat^if^Tic 
X ItJlllU odJJldlo 


nztO 


1931 


66 


1 996 

JL <Z>Z. \J 


ail 9653463 


x 1U1I1U odjpiC/ll& 


R ahO pffprtnr n40 


1 99S 


66 


1226 


AAM25756 


Homo sapiens 


HYSE- Human protein sequence SEQ 
Tf) NO* 1 971 


1070 


61 


1227 


gi22 17970 


Homo sapiens 


p40 


1597 


81 


1 997 
1ZZ / 


ail 96S1461 

gl IZUJJH-Uj 


nomo odpiciiiS 


XvdDy C11CCLUI pH-U 


1 

1 J oo 


R1 

O 1 


1 997 


A A1VT9S7S6 


nomo sapiens 


ri X oiJ»~ IxUIIlall piULCUl oCqUcllOC OJZ/V^ 

ID NO:l27l. 


1 416 


7S 


1 99£ 

1ZZO 


ai6S7917Q 
giu j / £>o i y 


nomo bapiciib 


UJJ / !7lM lO.Z ^OJJ/I UlllUlll^ Id-CtOl 1 ) 


9Q69 

Z7UZ, 


69 
uz 


1 998 


ai101 5S3R 


noinu bapiciio 


YY1H"*1**SIT nllil l_CnAPlflPltl7 T'xlirtc-i^li o td CP* 
llUOlCa.1 LlUcll-LjpCL/lllL/lty [JllUbpiltltcloC 


Z7JJ 


69 


1 998 


ail S 909 601 


-L^l UoOpillld. 

melano master 


^D1 0S41n 


9006 


45 


1229 


ABB50213 


Homo sapiens 


INCY- Human transcription factor 
TRFX-64. 


2087 


100 


1229 


gi 10334466 


Homo sapiens 


dJ63P18.1 (novel zinc finger protein) 


2063 


100 


1229 


AAM78755 


Homo sapiens 


HYSE- Human protein SEQ ID NO 
1417. 


937 


43 


1230 


gil9908264 


Homo sapiens 


N-RAP-like protein 


2766 


94 


1230 


gi2351568 


Mus musculus 


N-RAP 


1702 


56 


1230 


AAU21865 


Homo sapiens 


HUMA- Human cardiovascular system 


1136 


98 
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SEQ 
m 


Hit ID 


Species 


1 Description 


S 


Percent 
identitv 








antigen polypeptide SEQ ID No 639. 






IZj I 




nomo Sapiens 


DVJJL/A- AlUlllan A A Avis SlAUOLUiai 

domain nrotpin SFO TD NO* 2 

lAUXliaiii pAUlGill Oijy AA> 1 > V-/ . ^ . 


2757 


99 


1 711 


A AMQ1700 

XT.xT.lVA ZJ J 1 \j\J 


AALJ111U bapitllb 


HFT T- Human nolvnentide SEO ID 

A A J_jA-vA AAlAlliail LJ\Jly \J^\J LAVAOj kJAjV^ 1A/ 

NO: 3623. 


2257 


99 


1 711 


A/l V V Jv/v 


AAU11AU oaUlvHi} 


T TSSH Human OGT nrotein 


129 


24 


1 717 


A AM4041 8 


Hnmn QaTMPnQ 


HYSE- Human nolvnentide SEO ID 

A A A kJA—r 11U111CU1 L/Ui V UvUUUv IJLy 

NO 3563. 


6480 


100 




A AM47707 


Mnmn QanipnQ 

AAUH1LJ OdJL/lC'llO 


HYSE- Human nolvnentide SEO ID 

AAA O AAlAliAClll L/Uly UwU LAVAw Uijy 1A^ 

NO 7133 


6296 
\j i— \j 


100 


1232 


AAW01825 


Homo sapiens 


CLLT Human metalloproteinase. 


107 


34 


1 711 


cn'1 765571Q 


T-Tiimr* cairipnc 

AAvJUHJ ocipitllo 


i-iA477F5 7 1 fnovel nrotein i^nfhrm 1^ 

LfjrxrT / 1—i X ^AAlJVt/A JLJAVJL^/lAlj loUlUlllI A y 


1179 

A A / 


100 


1233 


gil75 12323 


Mus musculus 


Similar to RIKEN cDNA 2610016C23 

HCllC 


1135 


66 


1233 


gil2655240 


Homo sapiens 


bA472E5.2.2 (novel protein, isoform 2) 


994 


98 


1 7^4 


xr.xT.vjr / 


T-T/TT"l"lf"\ OQtMPTIO 

irLoixiL) Sapiens 


AjJLvJ VV XAUIIlall HiJUlsLI'IIIC O / pAULClll 

16. 


751 


1 00 

A \J\J 


1 714 


gl 1 ZUW J J7U*+ 


xAOino Sapiens 


AL/UJU 


751 


1 00 

A UU 


1234 


AAG78905 


Homo sapiens 


BIO W- Human ribosome S7 protein 1 6 
peptide fragment. 


77 


100 




giiyuoooio 


rincepnaiiiozoon 
cunicun 


similar ny to xv/viJ^f protein 


711 
Z 1 1 


It 




gizuu /JUUO 


ivius muscuius 


similar 10 lopoisomerase yurvr^j n 

V\ i fi n i n fr irr/^tf^iTi 
UlllLllllg pAUlCAll 


1 04 


77 


1 715 


A A V01 1 81 

Ari I Uj 1 OJ 


nnjiiio oapiciib 


T'^IT TT? / T^OT^rxi c f\YY)f*va q r 1 T hinHino 
1 OUJV 1 UjJUloUlllC/lasC AA UlllClAllg 

protein 2. 


197 


76 


1 716 

1 Z. J KJ 


A A061817 


AAvJlllvj oaUlullo 


7\TTh _ Amino apiH Qpniifnpp of linmari 

\JAil i A; XT.111A11VJ avvlLl obUUbilL'ti Ul llUlliail 

cardiomvonathv associated nrotein 
(CAP). 


1869 


100 

1 \J\J 


1236 


AAG93278 


Homo saniens 


NISC- Human nrotein HP03 1 15 

± ^ X V—/ X X V4XXJ.UX X \J X XX V<f i-/ X X • 


1869 


100 

-A \J \J 


1236 


gi9945010 


Mus musculus 


RING-finger protein MURF 


1116 


95 


1 717 


ail 1 Q08000 

gl A l^UOvww 


ATAU111U odUlCllo 


ROT rnrpnfpQQnf Qhoft i cnfnriTi 

Uv/Aj v LUlvUlvOOxJl OllVJA L loUlUllll 


5272 


100 


1 717 


1 A .7 V/ / yyij 


TTomn cani f»n q 

AAvXlllLJ OdJJlC/llo 


AJ V^A^ VJ LfUlvUiVQOvl 


5246 


99 


1 717 


A AM961 54 

XT-XTAYAJ' U J. 


AAvJlilU oapit/llo 


TTTTA/TA- Human rpnTodnptivp c vstpm 

A A XVAXT. A AlAllltAll 1UU1UUUV/UVU ojrotl/AAA 

related antigen SEQ ID NO: 4812. 


290 


96 


1 218 


ail 7655178 


Unmri QaniPTlQ 
AAvxAllvJ oauibiio 


ov n fH f* a rK owl a tp paTriftT* 


1552 


100 


1238 


gi 1265 5 649 


Rattus 

1AUA VCglV/lAo 


motochondrial oxodicarboxylate carrier 


1297 


82 


1 218 


ai 1 8447084 


Titt* c r\in n 1 1 a 

A^IUoUpilAia 

melanogaster 


AT21724n 

XT. A X-—? Z»Z.*T^ JU 


581 

-J \J A 


41 
~ i 


1 710 


ail 576471 


ivius muscuius 


"K' AP1"R 

AVxva J; A3 


1 1 5 


71 


1239 


gi 15264 19 


Mus musculus 


KAP3A 


115 


21 


1 O^Q 
LZjy 


A/\±5U ly 1 d 

• 


xiomo sapiens 


u i vJxv xt. numan neural piaKopuiim 
related armidillo protein. 


1 07 
1 uZ 




1240 


AAY01070 


Homo sapiens 


SUME Human l(3)mbt protein 
sequence. 


1461 


48 


1 740 


o"i 181 1111 

gl-J O 1 A 1 1 1 


AAU1I1U bapiciio 


IT! imVit "nrotpin Tirimoloo" 

JlllUl. JJlVJl.t/111 llVJlllWHJg 


1461 


48 


1240 


gil 1323323 


Homo sapiens 


dJ138B7.3.1 (continued from 
dJ862K6. 1 in Em: AL03 1681) 


1447 


51 


1241 


AAG00343 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID NO: 4424. 


277 


94 


1241 


AAM89138 


Homo sapiens 


HUM A- Human 

immune/haematopoietic antigen SEQ 
ID NO: 16731. 


260 


97 


1241 


gi 21296312 


Anopheles 


ebiP3199 


441 


28 



wo 



03/029271 



PCT/US02/30474 



263 
Table 2B 



TTi 


1111 11/ 


opecies 


ucsu lpuon 




X tJI C till I 

iHpntifv 




|gb|EAA084 

J / . 1 1 


gambiae str. 

PPQT 

JrHo 1 








1242 


gi20135645 


Homo sapiens 


SVAP1 protein 


2376 


99 


lz4z 


glo / /Woo 


Arabidopsis 
thaliana 


T7i/:Ai/i 1 a 
r 10A14.1U 


y3 


Zo 


lz4z 


• C A A 0 1 O 

glDUUoio 


baccnaromyces 
cerevisiae 


YnrUoOcp 


AO 


Zj 




gil433U44o 


Homo sapiens 


zinc finger protein RINZF 


4ZoZ 


yj 


1243 


gi4557143 


Rattus 
norvegicus 


zinc finger protein RIN ZF 


2792 


78 


1243 


AAB94280 


Homo sapiens 


HELI- Human protein sequence SEQ 

m XTO . 1 A 1 1 1 
IJJ JNU. 14/11. 


1343 


100 


1244 


gil7389709 


Homo sapiens 


Similar to RIKEN cDNA 1 1 10001 A07 
gene 


608 


100 


1244 


AAG03897 


Homo sapiens 


GEST Human secreted protein, SEQ 
11/ iNvj. /y /O. 


551 


91 


1244 


AAB93244 


Homo sapiens 


HELI- Human protein sequence SEQ 

liJ XNL/. 1ZZD J. 


77 


29 


1Z4D 


rril 1 C\A C\A 1 ^ 

glZ 1U4U41D 


nomo sapiens 


1 ruts pseuaouriaine syninase-UKe 
protein 1 


1 HAH 
1 /4 / 


1 nn 

1UU 


1Z4j 


AA x 3 JV04 


xlomo sapiens 


ojao 1 xixtenaea numan secreted 
protein sequence, SEQ ID NO. 213. 


o3y 


oo 


1Z4D 


giyyj iuu4 


i^seuaomonas 
aeruginosa 


uvln a pseuaouriome syntnase 


D ID 


3y 


1 


gii3 i /yu 


nomo sapiens 


similar 10 iNJisJirN cuin/\ zyuuuuyiu/ 
gene 


/Jl 


ou 


1 1A6. 


glZUJUO^f JU 


ivius mus cuius 


iviisjj/iN cl/in j\ zyuuuuyiu / gene 


HA& 


jy 


1246 


AAG74211 


Homo sapiens 


HUMA- Human colon cancer antigen 
protem oJcv id iNij.4y id. 


597 


96 


1247 


AAM80283 


Homo sapiens 


HYSE- Human protein SEQ ID NO 

3yzy. 


1202 


58 


1247 


gi6409345 


Homo sapiens 


zinc finger protein ZNF180 


1196 


60 


1247 


gi8050899 


Homo sapiens 


ZNF180 


1196 


60 


lz4o 


ABBy03y4 


Homo sapiens 


HUMA- Human polypeptide bbQ ID 

rt\J Z / /U. 


14o3 


AA 

99 




gllDZ 140OD 


nomo sapiens 


oimiiar to KJ.JsJiJN cl/jna Z3 iuuoiuu4 
gene 


QOA 

oZ4 


1 AA 
1UU 




A AD fcACil 
AAdjO'+Uj 


nomo sapiens 


KiJoxi/ numan prostate cancer antigen 
protem sequence oEy xlj imu.^oi. 


on 

£513 


y<5 


1249 


AAB60476 


Homo sapiens 


INCY- Human cell cycle and 
proiireration protem l. i jtiv-zh-, oii^ 
ID NO:24. 


1592 


100 


1249 


gil4603190 


Homo sapiens 


MAGEF1 protein 


1592 


100 


1 O/IQ 

1Z4V 


gn iuyoi3 / 


Homo sapiens 


A/1" A OT^T? 1 

MAlxbr 1 


1 COO 

IDyz 


1 AA 

1UU 


1250 


gil4043262 


Homo sapiens 


Similar to RIKEN cDNA 1500026B10 
gene 


1076 


100 


1 1 ^a 
IZjU 


glZUU /3ZDo 


Mus museums 


T> TV"CT"KT y>"P»XTA 1 1 A A A A 1 "P* 1 G rrr.no 

KJ-JSJiiN cujna iiuuuuiuiy gene 


/JO 


/3 


1250 


gil7646146 


Homo sapiens 


B lymphocyte activation-related 
protein 


462 


47 


1251 


AAE06142 


Homo sapiens 


HUMA- Human gene 7 encoded 
secreted protein fragment, SEQ ID 
NO:204. 


566 


98 


1251 


AAE06047 


Homo sapiens 


HUMA- Human gene 7 encoded 
secreted protein HBJFJ26, SEQ ID 
NO:109. 


566 


98 
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ID 


rill 1JJ 


opecies 




s 

score 


Percent 
identity 


1ZD 1 


A A VQ71 


JnLOIXlO sapienb 


WTTTVyfA Wnman ^prretefl nrotein 
cpniiprrp SFO TD NO204 


566 

\J \J 


98 


1 9S9 


ai9477^1^ 


XJ.wlJ.lVJ oa.jJlCJ.lo 


F25965 3 


2718 


87 


1252 


AAE13842 


Homo sapiens 


CORI- Human lung tumour-specific 
nrotein 20129 


540 


33 


19^9 


oil S 1 4S7Q7 




VjfiQif "nrAliTip-Tir.il nrntein 


330 


31 


1253 


gil5139362 


Mus musculus 


aczonin 


6067 


96 


1 9^1 




ivallUS 

norvegicus 


mi llti Hnma in T^T"f 3 Q"\m a All C PVTATTlJltTIV 

lllUlLlLUJlllalll jJlC&yila.jjLl^' vjr Iv/illvllilA 

protein Piccolo 


6043 


95 


lZJJ 




OallUb gallUo 


aUZiVJlliii 


5501 


87 


1254 


gil5139362 


Mus musculus 


aczonin 


6006 


95 


1254 


gi7493836 


Rattus 

* 

norvegicus 


multidomain presynaptic cytomatrix 
protein riccoio 


5982 


95 


1 O C A 

1254 


gl6433o44 


Ljaiius gallus 


aczomn 


^440 


Ow 


1 o c c 

1255 


AAY3946 / 


Homo sapiens 


A TD /^H Lj TJararArtQ-n Alio finAtinpl^Qf 

AjDvjrrs- Heterogenous nuonuoicdr 
protein K. 


9^07 


04 ! 


1 o c c 

1255 


• ,1 A AO OA 


TD +4-, , r-, 

Kattus 
norvegicus 


ox>-stretcn Dinamg protein ^^or>jr ) 


9^07 


04 


1255 


giZ414 /o 


riomo sapiens 


neterogeneous nuciear 
ribonucleoprotein complex K; hnRNP 
i\^ 


9^07 


04 


1256 


AAW47589 


Homo sapiens 


BOEF T-cell receptor beta-chain. 


537 


91 


IZjO 


gllDDZ4yD 


Homo sapiens 


v segment iraiisidiiuxi piuuut/L 


^4 


00 


1 oc^ 


glioZloO / 


riomo sapiens 


i ceil receptor oeid. L/iid.111 vaiiauic 
region 




00 


LZd / 


A A "N/fQ/l 1 7'7 


riomo sapiens 


nuivi/\- nuiiictii icpiuu-UL/iivc oyotcin 
related antigen SEQ ID NO: 2835. 


^70 


100 

J. WW 


IZj / 


giouyozoy 


iNiconana 

4-<^ X-» q nil rn 


JtVJJ/lJ 


j ,7 


22 


1 9^7 
1Z0 / 


o-i7^zLQ9 1 0 


-tsaDcsid. 

UlgCIIUlllcl 


ZrUU l\l_x U. dllLlgCll jJZ^W 


298 


25 


1258 


AAG93327 


Homo sapiens 


NISC- Human protein HP 10502. 


1438 


100 


1258 


AAB43628 


Homo sapiens 


HUMA- Human cancer associated 

-r\i*r\+<='ir» cpniipnpp QTh T"T\ ~NTf~V 1 07^ 
piULCUl bCLjU.CllL'C OJQV^ LIS INVy. 1 \J / J . 


1101 


99 


lZDo 


gizu^ /yy4.5 


ivius muscuius 


Similar ta RTTf FTsJ rTilVJA 40^0S^^A/T1 R 

oirrjllal LO IvllViJ/lN UJ_^1N/A. t r7JuJ JJlVll O 

gene 


612 


59 


izoy 


glioO /OjUO 


riomo sapiens 


TTT Tfim ^0 rvrATPin 
rLJUUlJu piuiciii 


8766 


99 

ZS -s 


1259 


gi3435244 


Homo sapiens 


centriole associated protein CEP1 10 


4915 


99 


I25y 


gll oU IKJODD 


rlomo sapiens 


D/viODr^^H-.z ^centrosomai protein i ) 


4RR7 


08 


1260 


ABB06784 


Homo sapiens 


SHAN- Human cancer cell growth 
inniDiung protein r^r^H- / oz oiiv^ ir^ 
NO:20. 


205 


39 


1 O /£ A 

lZoU 


gllUo34 / 14 


Homo sapiens 


Jr Jr4 / OZ 


90^ 


^0 


1 O/CA 

lZoU 


glJo /oUj 1 


Caenornao cutis 
elegans 


nz ii^uj.z 


1 W-7 


^1 

J> 1 


1 1^ 
1Z01 


AddUOIJo 


Homo sapiens 


\_s\jivi.r'- nuiiiaix i > o proiem bctiu.ciiuc 
SEQ ID NO:225. 


1 626 


93 


1261 


AAM25414 


Homo sapiens 


HYSE- Human protein sequence SEQ 
ID NO:929. 


874 


97 


1261 


AAY07040 


Homo sapiens 


LUDW- Breast cancer associated 
antigen precursor sequence. 


856 


39 


1262 


gil4550457 


Homo sapiens 


pellino (Drosophila) homolog 2 


2290 


100 


1262 


gilO-242353 


Homo sapiens 


pellino 2 


2290 


100 


1262 


gil0242357 


Mus musculus 


pellino 2 


2110 


92 


1263 


gi 18478648 


Homo sapiens 


EKN1 


1790 


98 



WO 03/029271 



PCT/US02/30474 



265 
Table 2B 



OTP /"V 

SEQ 
ID 


Hit ID 


Species 


Description 


store 


jrerceni 

1U L-Il Lltj' 


1263 


gi20071434 


Mus musculus 


Similar to EKN1 


1389 


75 


1263 


gliy527o3J 


Drosophila 
melanogaster 


AT1 AC1 

Al iOM Jp 




Z J 


1264 


gi632549 


retromyzon 
marmus 


XT"C 1 on 


ooc\ 

ZZU 


0/1 
ZH 


1Z04 


glZU4Z)Z 1 Ol 


Lams tamiiiaris 


retinitis pigmentosa <jr lipase regulator 


oon 

ZZU 


Zo 


1264 


gi9837379 


Homo sapiens 


retinitis pigmentosa GTPase regulator 


210 


28 


1265 


gil45 17637 


Homo sapiens 


RGPR-pll7 


87 


30 


1265 


gil2313999 


Homo sapiens 


JTOOCT *7 O 1 / _ „ ^ „ J J M -„ r il l ^».*..^..» 
aJbo5L7.3.1 (opioid growth tactor 

receptor (7-60 protein), isoform 1) 


O 1 

o 1 


1 1 


1265 


AAY92809 


Homo sapiens 


PENN- Human opioid growth factor 
receptor spliced, version o. 


/o 


o c 
Zo 


1266 


gil8204361 


Homo sapiens 


PNAS-127 protein 


380 


100 


1266 


gil2751 100 


Homo sapiens 


TVKT AO 1 n 

rNAb-lz/ 




1 fiA 

1UU 


1266 


gi|142 10498 
|ref|NP 115 
879.1| 


Homo sapiens 


PNAS-127 protein 


380 


100 


1267 


ABB50177 


Homo sapiens 


INCY- Human transcription tactor 
TRFX-28. 


1 C7/T 

lo/o 


100 


1267 


gillol 1473 


Mus musculus 


Deltex3 


1 O C O 

1858 


98 


1267 


AAU 16308 


Homo sapiens 


TTT TTV 4" A TT _1 . j 1 ^. _ ' 

HUMA- Human novel secreted protein, 
SeqID 1261. 


1 "5 no 

1302 


96 


1268 


AAB95168 


TT 

Homo sapiens 


TTpT T TT j ' „ — OT7/™\ 

HELI- Human protein sequence SEQ 
ID NO: 17221. 


2667 


99 


1268 


gil3278316 


Mus musculus 


Similar to RIKEN cDNA 2610507L03 
gene 


2174 


70 


1268 


AAG75001 


Homo sapiens 


HUMA- Human colon cancer antigen 
protein SEQ ID NO: 5 765. 


776 


95 


1269 


gi60 16842 


Mus musculus 


nuclear protein ZAP 


3147 


61 


1269 


AAY72168 


Homo sapiens 


T*l A T TO / T T T» *K T A , f "1 « 

BAUG/ Human RNA metabolism 
protein (RMEP-8). 


2798 


100 


1269 


AAM75582 


Homo sapiens 


X *"/\T 1 — ' TT 1 

MOLE- Human bone marrow 
expressed probe encoded protein SEQ 
ID NO: 35boo. 


**T O O 

788 


100 


1270 


AAB93344 


Homo sapiens 


HELI- Human protein sequence SEQ 

tt\ \TA. 1 1 A cr\ 

ID NO: 12459. 


92 


95 


1270 


AAO 13507 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 27399. 


92 


95 


1270 


AAO13502 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 27394. 


92 


95 


1271 


AAG81396 


Homo sapiens 


ZYMO Human AFP protein sequence 
SEQ ID NO:310. 


629 


100 


1271 


AAG00714 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID NO: 4795. 


527 


99 


1271 


gi 18447274 


Drosophila 
melanogaster 


TTT r\ ~\ ^ C A 

HL01254p 


287 


45 


1 010 
l Z 11 


gl 1 j/7ZUy / 


urosopnua 
melanogaster 


T V\19.00f\r\ 
ULooZZOp 




77 
Z / 


1272 


gi24 14626 


Schizosaccharom 
yces pombe 


ubiquitin regulatory domain (UBX) 
protein 


404 


31 


1272 


gi7527718 


Arabidopsis 
thaliana i 


T5E21.7 


336 


37 


1273 


gi4263747 


Homo sapiens 


general transcription factor 21 


730 


79 


1273 


gi24 15382 


Homo sapiens 


TFII-I protein 


730 


79 


1273 


gi2440078 


Homo sapiens 


SPIN protein 


730 


79 
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SEQ 

TT* 
IJJ 


Hit ID 


Species 


Description 


S 


Percent 

lUClilllY 


1274 


AAB94010 


Homo sapiens 


HELI- Human protein sequence SEQ 

ID INL/. I^Ijv. 


1921 


91 


1274 


AAM79426 


Homo sapiens 


HYSE- Human protein SEQ ID NO 

3U /Z. 


475 


31 


lz /4 


AAiVl / o44Z 


xiomo sapiens 


xi x oil- riuman proiein oxlv^ ijj inw 
1104. 






12 /D 


AAA /f m O 1 /I 


Homo sapiens 


rltii-rl- riuman polypeptide, o-ca^j ru 
>sr\* 9^1 £ 


Ijj j 


QQ 

yy 


1275 


gil5292283 

• 


Drosophila 
meianogaster 


LD45447p 


172 


30 


1275 


gi607003 


Podospora 
ansenna 


beta transducin-like protein 


87 


23 


IZ /O 


glOUZ4 jo 


id os taurus 


pnospnoprotem 




01 
y i 


1276 


AAM50706 


Homo sapiens 


DIAD- Human breast specific gene- 
encoded proiem oqiiiaiiiUH-z. 


850 


96 


1276 


gi7243755 


Homo sapiens 


neuronal phosphoprotein DARPP-32 


508 


100 


i 077 

IZ / / 


A A ~\AATX 1 Q 


riomo sapiens 


DUJJn- riuman zmc linger sirucuurdi 
domain 52. 




1 no 


1 077 
IZ / / 


gn jyzyu4u 


xiomo sapiens 


o miliar 10 ivirvij/iN cj-/in/\ jyDV^VAr lj 
gene 


z D*\y 


1 on 


IZ / / 


gn / izoj 


xiomo sapiens 


zinc linger protein zy / 


jy*t 




1278 


AAB95788 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 18745. 


2742 


99 


12 /& 




Mus musculus 


zinc linger protein splice variant riZ/i- 
B 


ZJUo 


50 


1278 


gi5579307 


Mus musculus 


zinc linger protein splice variant FIZ1- 
A 


2308 


86 


1279 


gil016116 


Cyanophora 
paradoxa 


ribosomal protein LI 


76 


28 


1279 


gi|20834835 
|rei]XP 136 
513.1| 


Mus musculus 


similar to dJl 1 1 1 A8.2 (novel protein 
with fibronectin type III domain) 


79 


29 


1279 


gi|l 1467315 
|reflNP_043 

1 *7T 1 1 
1 11. I \ 


Cyanophora 
paradoxa 


ribosomal protem Ll 


/O 


ZO 


1280 


AAB94144 


Homo sapiens 


HELI- Human protein sequence SEQ 

T"T\ \Tf\ .1/1/11 O 

1U INw. 144 1 o. 


811 


99 


1280 


AAM42664 


Homo sapiens 


HUMA- Human kidney related 
polypeptide oiii^ id jjj. 


136 


48 


1280 


AAM79272 


Homo sapiens 


HYSE- Human protem SEQ ID NO 

lyDH. 


136 


48 


1281 


AAG74781 


Homo sapiens 


HUMA- Human colon cancer antigen 
protem bJbv^ ijj jn(J:jD4j. 


533 


99 


1281 


AAB53801 


Homo sapiens 


HUMA- Human colon cancer antigen 
protem sequence oxsv^ id inu.io41. 


307 


95 


1 OCT 

lzoz 




rlomo sapiens 


xlxiLri- riuman protem sequence oxiv^ 
ID NO T 194S 


ZOl O 


QQ 

yy 


1282 


AAM93850 


Homo sapiens 


HELI- Human polypeptide, SEQ ID 
NO: 3935. 


1716 


99 


1282 


AAU 16242 


Homo sapiens 


HUMA- Human novel secreted protein, 
Seq ID 1195. 


1677 


100 


1283 


AAB94927 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 16392. 


806 


100 


1283 


gil7979897 


Homo sapiens 


BAALC 1-5-6-7-8 


74 


31 


1283 


gil7979895 


Homo sapiens 


BAALC 1-5-6-8 


74 


31 
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ID 


Wit TD 

XXII XAJ 




Df»scrintion 


s 

score 


Percent 
identity 


1 9R4 


ml 7S9638 

Jill f^s£*\J-J*J 


1XU111VJ OUUXvlXO 


RT14 

XV X X ~ 


630 


100 


1284 


gil703640 


Homo sapiens 


GCN5-like 1; GCN5L1 


630 


100 


1 9R4 


cri?7fiQ^R^ 

i^IZ. / U7JOJ 


A/Tnc mncpiilnc 

lYXUb lllllOlvlllUo 


GPN5T 1 nrotein 


603 


96 


1 9R^ 


A A V77471 


XXVJ11117 odLJIC'lia 


SOT-TF Human deubiauitinatint? rjrotein 

uv/iiJj i xixixiaii u^uun^uiinittiiii^ ijx. v/ ix x 

Dubl 1 SEO ID NO:34. 


2455 


87 


1285 


AAG64049 


Homo sapiens 


KAGA- Human deubiquitinising 
enzvme 


2448 


88 


1285 


AAW30711 


Homo sapiens 


DAND Human ubiquitin-specific thiol 
nrotease DUB D38378 


2262 


87 


1286 


gil3562000 


Latrodectus 

o^potn ptri pi 1 q 


major ampullate spidroin 2 


132 


46 


1286 


gi545069 


Mus sp. 


Brn-3.2 


128 


42 


1 9R6 


oi9fi94Q 


UijiCd aauvd 




125 


30 


1287 


ABB07304 


Homo sapiens 


SUGE- Human protein phosphatase 
^(TPftSft 

OvJi \J. 


2415 


100 


1287 


AAB85358 


Homo sapiens 


INCY- Human phosphatase (PP) (clone 
TD 1793447rDi i 


2161 


98 


1 9G7 
1Z5 / 


a A"\/T^oni n 


T-TnmA oof\"i<^Tio 
XXUlIlU odpiCIlo 


FTV^F- Human nnlvnetitiHe SFO TD 
xx x oIj" x x uiiidii ijui^y jkyt^^7 tiuL/ oxj\^ xi7 

NO 2155. 


929 


45 


1 ICQ 
1 Zoo 


/VJl>JdU / jW4 


XxUIIlU odpiCIlb 


Tf~iT^_ Mnman "nrotpin WhAcj-nhata <5P 

uUvJIj XXUllldll jJlULGlll JJll\7o JJlldLaoC; 

SGP050. 


1333 

X *J *J ~J 


99 


1 9RQ 
1ZOO 


A ARS^^ 
/\.tyx3 Ojjjo 


xxuiiiu octpiciio 


TfsJPY. Human nhrmiVhata^p fPPi felone 

Xl^Vw^ X XXlXillC4.ll LJllvJO LJX1CI LClJKj ^X X J y^XVJlA^/ 

TD 1723447CD1 1 


1079 


97 


1288 


AAM39010 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 2155 


704 


52 


1289 


gil0304393 


Bombyx mori 


ornithine decarboxylase antizyme 


75 


51 


1 9RQ 


cri 1 74RR61 1 


T'} Iri fi 1 ct 1 

X LllS.li.Ll l£ Ll 

rubripes 


T^rain anlcvrin 9 


73 


48 


1 9RQ 


CriRQRO^R 


T'ciXrTfii cm 
1 cUVllUgU. 

rubripes 


FRANK? nrotein 


73 

/ w/ 


48 


1 900 


crM 9751 107 


XXU111U odJJXC.Uo 


FVG1 

Xj V VJ 1 


1106 


100 


1290 


AAM95531 


Homo sapiens 


HUMA- Human reproductive system 
related antiffen SEO ID NO* 4189 

lE/icxLi/ii ciiiLig ( v»'ii ux~i\£ ix/ x y v_/ • ~ i i 


219 


100 


1290 


gi2 1064 167 


Drosophila 

T"n£*1cjT1f"»<T£l cfv^T 

UlClalUJ^do ICI 


AT27353p 


212 


29 


1291 


gi20988071 


Mus mus cuius 


Similar to RIKEN cDNA 260001 1E07 
gene 


919 


80 


1291 


gil3562004 


Nephila 

IIldUagdoL/dllCxiolo 


major ampullate spidroin 2-like protein 


144 


26 


1 901 


glZ0UJ70 / 


jZiCjumc 

iicipco Vil Llo 1 


94 


136 


21 

X< X 


1 909 


A A0751 


XXOIIIU odpiCllo 


T-TT TA/T A - TTnman rnlnn panppr anti opn 

ilUiVlri XXLllllu.ll L/U1U11 V^Ullv^l ClllLl^V^ll 

protein SEQ ID NO:5924. 


361 


73 


1 909 


A AMQSORfi 


xxUIIiU bapiCllb 


T-TT TAyf A - TTiiman rpr>Tr»Hnpti VP QV<itpm 

XXl_JlVX^Y X XLillldll 1 CJJl VjVXtiLvtl V C o^ott/lil 

related antigen SEQ ID NO: 3744. 


303 


96 


1 909 
1 Z7Z 


ryil ^5091 75 


T piohmfanin 
XjClolillldllid 

major 




270 


24 


1293 


gil7932848 


Homo sapiens 


Spir-2 protein 


1060 


100 


1293 


gi20072945 


Mus musculus 


Similar to Spir-2 protein 


932 


85 


1293 


gi2 1449822 


Mus musculus 


Spir-2 protein 


932 


85 


1294 


AAU20195 


Homo sapiens 


HUMA- Human novel endocrine 
antigen, SEQ ID No 252. 


587 


91 


1294 


gi20151337 


Drosophila 
melanogaster 


GH0483 lp 


481 


46 


1294 


gil3548492 


Caenorhabditis 


ZC4 10.7b 


349 


43 
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elegans 








1295 


gil81097 


Homo sapiens 


gamma- A-crystallin 


968 


99 


1295 


gi387l35 


Mus rnusculus 


gamma- A-crystallm 


O A C 

845 




1295 


gi203627 


Rattus 
norvegicus 


gamma- A-crystallin 


837 


83 


1296 


AAG02474 


T T * 

Homo sapiens 


GEST Human secreted protein, SEQ 
ID JNU: 0555. 


zoo 


A/C 


1296 


gi7544050 


Streptomyces 
coelicolor A3(z) 


beta-N-acetylhexosaminidase 


69 


30 


1296 


gil05 18509 


Streptomyces 
piicatus 


B-N-acetylhexosaminidase 


69 


33 


1297 


AAB92496 


Homo sapiens 


HELI- Human protein sequence SEQ 

TF% fvTO. 1 ACQQ 
LL) JNU. lUDi/O. 


966 


38 


1297 


gi6581093 


Mus rnusculus 


transposase-like protein 


910 


36 


1297 


gi6581097 


Homo sapiens 


transposase-like protein 


o5 / 




1298 


gil2060820 


Homo sapiens 


serologically defined breast cancer 
antigen IN Y-J3K- 1 5 


2775 


100 


1298 


AAB95624 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID JNU:lo344. 


963 


35 


1298 


AAY07001 


Homo sapiens 


LUDW- Breast cancer associated 
antigen precursor sequence. 


706 


93 


1299 


AAU74387 


Homo sapiens 


CORI- Breast tumour-specific protein 
B305D rusion construct. 


595 


49 


1299 


AAU74379 


Homo sapiens 


CORI- Breast tumour-specific protein 

T"» 11 A ~ 1 1 X*_ — ni i /— i C\ ~\ il 

BllAgl isoiorm Bl 1C-9,16. 


595 


49 


1299 


A AT T~l /tTIO 

AAU74378 


TT * 

Homo sapiens 


CORI- Breast tumour-speciiic protein 

"D 1 1 A 1 J ^ XV. D 1 1 A 

BllAgl isoiorm B 1 lC-y. 


CAC 

595 


A A 

49 


1300 


AAG01406 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID NO: 5487. 


421 


100 


1300 


gi2 1429922 


Drosopnila 
melanogaster 


GH12382p 


1 A*7 

107 


26 


1 O A A 

1300 


gi509 1552 


Arabidopsis 
thaliana 


1 10024.21 


1 A 1 
1U1 




1 "3 A 1 

1.301 


gil J 1 1 14oU 


riomo sapiens 


JNesnBr 


ZzOo 


on 


1301 


gi2007 1 840 


Mus rnusculus 


Similar to DKFZP586L2024 protein 


1851 


81 


1301 


AAE01791 


Homo sapiens 


T TT T A yT A TT o 1 ^_ J 

HUMA- Human gene 22 encoded 
secreted protein HOHDF66, SEQ ID 

.110 

JNO:l 12. 


1 1 T1 

1 172 


0 1 
ol 


1302 


AAM25877 


Homo sapiens 


HYSE- Human protein sequence SEQ 
ID NO:1392. 


201 


44 


1302 


AAE 10329 


Homo sapiens 


INCY- Human transporter and ion 
cnannel-o ( I R1CH-6) protem. 


201 


44 


1302 


AAZ11910_ 
aal 


Homo sapiens 


AXYS- Human potassium channel 
K+Hnov28 cDNA (5 splice variant 4). 


183 


34 


1303 


AAB28049 


Homo sapiens 


HUMA- Human secreted protein SEQ 
ID NO: 97. 


204 


100 


13U5 


glO / 1 J l^fo 


ivius museums 


selectively expressed m emoryonic 
epithelia protein- 1 


/z 


*z 1 
j 1 


1303 


AAO 10249 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 2414L 


68 


31 


1304 


AAV06060 
aal 


Homo sapiens 


UMIS Human imidazoline receptor 
subtype 1 cDNA. 


91 


32 


1304 


AAW43397 


Homo sapiens 


UMIS Human imidazoline receptor 
subtype 1 degradation product 12. 


91 


32 


1304 


AAW43396 


Homo sapiens 


UMIS Human imidazoline receptor 


91 


32 
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suDtype 1 . 






1305 


gil9264036 


Mus musculus 


membrane-associated tyrosine-and 
tnreonine-speciiic cacz-inniDitory 
kinase 


94 


26 


1 mz 


AAWjOOiy 


no mo sapiens 


yjrs 1 A.- Human ceil gro win regulatory 
protein. 


QA 


ZO 


1 31D 


glZ4oUUZ.o 


Homo sapiens 


memorane-associatea Kinase 


OA 


Zo 


1 T A/C 

liUo 


gll 14 /ol 1 


oaccnaromyces 
cerevisiae 


l^plip 


1 j j 


zo 


1306 


ABB04721 


Homo sapiens 


SHAN- Human PP2464 protein SEQ 
1JJ JN(J:1 /. 


101 


22 


1306 


gi6630459 


Arabidopsis 
tnaliana 


F23N19.12 


93 


39 


1307 


gil5277926 


Homo sapiens 


Similar to R1KEN cDNA 2700064H14 
gene 


1160 


99 


1 ic\n 

13U / 


•o nm 1 1 to 
glzUU / 1 Uo 


Mus museums 


X> TT?"C?XT /-%"P\XT A T7AAA^/1XJ1 A 

K1K±SJN CJJNA z /UUUo4rll4 gene 


1 1 /CA 
1 lOU 


on 


1307 


AAM90766 


Homo sapiens 


HUMA- Human 

immune/haematopoietic antigen SEQ 
iij JN(J: io3Dy. 


719 


71 


1308 


AAB93936 


Homo sapiens 


HELI- Human protein sequence SEQ 
1U JNU: 1 


813 


95 


1308 


AAM00936 


Homo sapiens 


HYSE- Human bone marrow protein, 

oxi^ 1U INU: 41z. 


313 


52 


1 AC 


glZUDUZ30 / 


nomo sapiens 


cnaperone~/vr5L^ 1 -HKe 


1 1 1 


_>z 




A A/1 f^l Q 


riomo sapiens 


r>LJiJ/\- Human lipoprotein precursor 
protein signal peptide 41 SEQ ID 

1NU.Z, 


1 n 1 0 

1 / iy 


1 AA 
1UU 


1309 


AAM95270 


Homo sapiens 


HUMA- Human reproductive system 
reiateo antigen o-ea^ iu inl^. jyzo. 


583 


100 


1 j\Jzf 


A A w^n^^n 

AAW jUOjU 


riomo sapiens 


vjnivi 1 Human secretea protem cione 
DgZHy 1 protein. 


Z> 1 J 




1310 


AAE15854 


Homo sapiens 


ELIL Human SEZ6 mature protein. 


75 


24 


1 'x 1 n 

IJ 1U 


A A T7 1 


riomo sapiens 


liJLii^ Human biiZvO protein. 


13 


Z4 


1.3 1U 


A A T TS 1 Q'7< 


riomo sapiens 


ijnc i - Human secretea protem 
SECP2. 




OA 

Z4 




glZ>OU4yj 


xCattus 
norvegicus 


ribosomal protein L24 


TO/1 

/y4 


1 AA 
1UU 




gl-J 1 jZoZ J 


J30S taurus 


riuosomai protein LyJU 


/y4 


1 AA 
1UU 


1311 


gi292437 


Homo sapiens 


ribosomal protein L30 


794 


100 


1:5 1Z 


glZ 1 U04 / / 1 


Drosopnila 
melanogaster 


D T T /Z 1 /I /C7 

Kl-16146 Vp 


521 




1312 


gi 18676554 


Homo sapiens 


FLJ00174 protein 


391 


34 


13 lz 


giloloz/95 


Drosopnila 
melanogaster 


GHl2489p 


365 


33 


ion 

1313 


AAYo4592 


TT * 

Homo sapiens 


T TX TTT T T A * * J 

UNIW Amino acid sequennce of a 
human artemin polypeptide. 


104 


31 


1313 


ffi 1742932^ 


xvclio WJIlla. 

solanacearum 


PROTEIN 




9Q 


1313 


AAW87504 


Homo sapiens 


SIB I- Human N-methyl-D-aspartate 
receptor subunit encoded by clone 
NMDA24. 


93 


31 


1314 


gi927637 


Anthocidaris 
crassispina 


dynein intermediate chain 2 


604 


.29 


1314 


gi6594639 


Homo sapiens 


dynein intermediate chain DNAI1 


602 


28 


1314 


gi2 10405 11 


Homo sapiens 


dynein, axonemal, intermediate 


600 


28 
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pOl > / pCpLlU.C, 1 






1315 


gi20530222 


Homo sapiens 


BLOCK 25 


1830 


100 


13 15 


gll 161 1j4j 


bus scrota 


aiaose i-ep unerase 


1 670 


89 


1315 


AAM40101 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 

MO 19/1 A 


1366 


100 


1316 


AAM60715 


Homo sapiens 


MOLE- Human brain expressed single 
exon piooe eiicuucu. pujiciu uLvv,/ lu 
NO: 32820. 


2351 


100 


1316 


giDy l /ooo 


z,ea mays 


cxtensm-iiKc proicm 


148 


94 


1316 


gll /4yo42 


Y arrowia 
upoiytica 


ceil wan proiem 


1 20 


35 


1317 


AAM25483 


Homo sapiens 


HYSE- Human protein sequence SEQ 


975 


99 


131/ 


a "did ill <;n 
Ar5Jb> 1 IzjU 


riomo sapiens 


n I oJC(- XxUlIldll l/L'-ga.lllliici icocpiui 
rinmnlnanp TD NO' 1 620 

ll(JllWJHJg,UC 5 O-L^V^ LL' l>IUilUiiu, 


795 


100 


1317 


AAG01757 


Homo sapiens 


GEST Human secreted protein, SEQ 


612 


99 


1318 


gil5 145797 


Sus scrofa 


basic proline-rich protein 


532 


29 


1 1 1 o 

13 lo 


gljy 1 /ODD 


Zj&zl mays 


extensm-iiKc proicm 


475 


25 


1318 


gi5305335 


Mycobacterium 
tuDercuiosis 


proline-rich mucin homolog 


428 


25 


1319 


AAU 11265 


Homo sapiens 


BODE- Human zinc finger protein 5 1 . 


1576 


64 


13 19 


gil5y29 /3 / 


Mus muscuius 


oimiiar to zinc linger proiem / 


1 SS7 




1319 


AAB93576 


Homo sapiens 


HELI- Human protein sequence SEQ 

1JJ InU.IZVoo. 


1518 


56 


1320 


AAM96103 


Homo sapiens 


HUMA- Human reproductive system 
reiateu anngen ojcv^ iu invj. h-/oi. 


516 


92 


1320 


gil7223819 


Homo sapiens 


transcription cofactor vestigial-like 2 
protein 


222 


47 


1320 


AAM94333 


Homo sapiens 


HUMA- Human reproductive system 
leiaxeo antigen oiiv^ ii-^ zyyi. 


158 


35 


1 n 1 
1 Jzl 




riomo sapiens 


nm_(i- n.uiiid.11 piuiciii ocquciiL'C 


1^99 


100 


1321 


gi21483586 


Drosophila 
meianogasier 


SD07343p 


94 


27 


Uzl 


rri 1 C7nn7 1 1 

gll o /UU / Lj 


urosopnna 
meianogasier 


•ff o nor* t*i t^tirvn C4i p Arpi*tTPc c f»r A i~m 
LlalloCllUllvllcll L/UlCjJlCooLll /lliu 


94 


27 


1 j/Z 


A ARQ477A 


nomo bapieiio 


T-TT7T T- TTiiman "nrntpin QpniiPTiPP SIT^-O 

jnCLr-L 11U111CU1 UlL/LCiil o^L^Ll^llV^ OXjV^ 

TD TsTO-1 ^862 


3838 


99 


1 jZZ 


ail 84701 17 

gl 1 OH- / 7l j / 


IYIUo IIlU.oOU.lUo 


n n cr i o tn rvH ti 

dliglClllvi till 


136 


25 


1322 


gi2754696 


Gallus gallus 


high molecular mass nuclear antigen 


134 


21 


1 

1 JzJ 


gii z>yzyuoi 


xiomo sapiens 


gene 


1371 


100 


1 jZj 


gll OOUJOjH 


jvius muo cuius 


RTT<rF7sr rDNA 1 S00001 K 1 7 f*ene 


1076 


80 


1 191 


criQQQ^9Q< 


rxomo odpieiio 




1056 


100 


1324 


gi292443 


Homo sapiens 


ribosomal protein S20 


472 


84 


1 194 


p-i57720 


T?pttii<2 ratlin: 


ribosomal Drotein S20 ( AA 1-119) 


472 


84 


1324 


gil7932978 


Homo sapiens 


ribosomal protein S20 


472 


84 


1325 


AAB 82946 


Homo sapiens 


UYNY Human androgen receptor 
trapped protein (ART). 


661 


30 


1325 


AAR23963 


Homo sapiens 


SNOW AFP-1 (Ala 2460 Val). 


661 


30 


1325 


AAR23962 


Homo sapiens 


SNOW AFP-1. 


661 


30 


1326 


gi 12002024 


Homo sapiens 


brain my038 protein 


371 


100 


1326 


gil0178219 


Arabidopsis 
thaliana 


DNA-binding protein-like 


97 


41 
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1326 


gil9171280 


Encephalitozoon 
cumcuh 


similarity to HYPOTHETICAL ZINC 
FINGER PROTEIN YKF9 YEAST 


92 


31 


1327 


gi6566147 


Drosophila 
melanogaster 


large Forked protein 


193 


30 


1327 


AAU20496 


Homo sapiens 


HUMA- Human secreted protein, Seq 
ID No 488. 


192 


41 


1327 


A AT TO O 1 '—t A 

AAU28174 


T T * 

Homo sapiens 


HYSE- Novel human secretory protein, 
Seq ID No 343. 


192 


A 1 

41 


1328 


A A T*"» r\ A 'I A C 

AAB94745 


T T * 

Homo sapiens 


xxx™»x x xx » • n"n/\ 

HELI- Human protein sequence SEQ 
ID NO: 15792. 


t-j ^ 

6782 


99 


"1 1 o o 

1328 


gi2l392l02 


Drosophila 
melanogaster 


RE22982p 


867 


27 


1328 


ABB89764 


Homo sapiens 


HUMA- Human polypeptide SEQ ID 
NO 2140. 


624 


97 


1329 


gil5991574 


Homo sapiens 


eferin 


825 


53 


loon 


■ 1 no/I /i 

gil 7945944 


Drosophila 
melanogaster 


Kh46851p 


337 


28 


1329 


gi2961133 


Drosophila 
melanogaster 


nuclear fallout 


337 


28 


1330 


AAM52332 


Homo sapiens 


BIOW- Human zinc finger protein 72. 


3635 


99 


1330 


gi 14456629 


TT _ • 

Homo sapiens 


dJ54B20.2 (novel KRAB box 
containing C2H2 type zinc finger 
protein) 


3619 


100 


1330 


gil5081398 


Homo sapiens 


krupp el-like zinc finger protein 


1823 


55 


1331 


gil39383l7 


TT * 

Homo sapiens 


Similar to zmc finger protem 202 


416 


43 


1331 


AAM40994 

* 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 5925. 


393 


43 


1331 


A A A TO AO 

AAM39208 


T T 

Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 2353. 


393 


43 


1332 


gi56952 


Rattus 
norvegicus 


put. preoptic regulatory factor-2 


381 


98 


1332 


gi2 190355 


Dictyostelium 
discoideum 


racGAP 


192 


30 


1332 


gi6226761 


Dictyostelium 
discoideum 


class VII unconventional myosin 


190 


30 


1333 


gi21359654 


Homo sapiens 


LGI4 


2398 


93 


ion 

1333 


_, • aah c /T O'O 

giz0975688 


T T * 

Homo sapiens 


leucme-nch glioma inactivated protein 
4 


2398 


/■"V 

93 


1333 


gi20975756 


Mus musculus 


leucine-rich glioma inactivated protein 

A 

4 


2169 


84 


1334 


gil 5020655 


Homo sapiens 


ATP/GTP-binding protein 


5570 


99 


1334 


AAB94270 


T T * 

Homo sapiens 


T TT — 1 T T T T i 1 <— i T" « 

HELI- Human protem sequence SEQ 
ID NO: 14689. 


4071 


100 i 


1 O O /I 

1334 


giz 14283 64 


Drosophila 
melanogaster 


GM14375p 


267 


21 


1335 


gil 1493544 


Homo sapiens 


PRO 1460 


412 


100 




giiDu /y^y i 


liomo sapiens 


oimilar to R1KEN cJJJNA lzUUU14H14 
gene 


1 one 


1UU 


1336 


gil3879544 


Mus musculus 


Similar to RIKEN cDNA 1200014H14 
gene 


933 


88 


1336 


gil4043175 


Homo sapiens 


Similar to RIKEN cDNA 1200014H14 

gene 


920 


100 


1337 


gi20988758 


Mus musculus 


RIKEN cDNA 1700030A21 gene 


679 


84 


1337 


A AB 67449 


Homo sapiens 


INCY- Amino acid sequence of a 
human chaperone polypeptide. 


566 


100 
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133 / 


giy /d / / o / 


Araoiaopsis 
thaliana 


tetramcopepiiae repeat protein z-iikc 


1 




1338 


glloZCO J 1 J 


Mus museums 


T TY1 
.L1A1 


JL JUt 


04 


1338 


gil8265513 


Gallus gallus 


LIX1 


1203 


83 


1338 


AAM93000 


Homo sapiens 


HUMA- Human digestive system 
antigen bbQ ID JNU: zJ4y. 


753 


87 


1339 


gil4024367 


Mesorhizobium 

1 X ' 

loti 


weak similarity to 

ubiquinone/ menaquinone bio synthe sis 
metnyl transrerase 


185 


32 


1339 


gi20804035 


Mesorhizobium 
loti 


HYPOTHETICAL PROTEIN 


148 


29 


1339 


gi2 1107889 


Xanthomonas 
axonopodis pv. 
citri str. 306 


methyltransferase 


115 


33 


1340 


gi559237 


Petroselmum 

• 

enspum 


tyrosine-ricn nyaroxyprolme-ricn 
glycoprotein 


on 


zo 


1340 


gi44U63y3 


Jb> os taurus 


aitierentiation emiancing iactor i 


oo 


jj 


1340 


gil7939907 


Pseudorabies 
virus 


UL36 protein 


84 


32 


1341 


ABB1 1819 


Homo sapiens 


rlYoii- riuman secreted protein 
homologue, SEQ ID NO:2189. 


Oil A 


Q7 


1341 


a \WTcr\~ic\o 

AAWo0395 


Homo sapiens 


(jrrijvi Y A secretec protein encoaea oy 
clone cwl543 3. 


loo / 


OR 


IT/11 

1341 


gl7547Uz9 


Homo sapiens 


wAr^-iike protem 


jt / 


JJ 


1342 


AAB50867 


Homo sapiens 


ZYMO Human zalpha3 1 . 


701 


100 


1342 


AAB54229 


Homo sapiens 


HUMA- Human pancreatic cancer 
antigen protein sequence i^Jbv^ lu 


1C\'\ 

/VI 


1 nn 

1UU 


1 q /i i 

134Z 


A A T3 1 C\T21 

AAJdI KjZdZ 


riomo sapiens 


^jiiivi i xiuman ietai piacema proiem 
fragment AP 1 62 1 i. 




07 


1344 


AAY /3J40 


xiomo sapiens 


iini^ x - iri i ivivi cione oiyoyy protein 
sequence. 


1 

ijjj 


uu 


1344 


A A "D/1 101 1 

AAr>4j>y 1Z 


xiomo sapiens 


xiuiviA- numan cancer associatea 
protein sequence SEQ ID NO: 1357. 






1344 


gi498721 


Homo sapiens 


zinc finger protein 


1047 


51 


1345 


gil3516895 


Homo sapiens 




0 1 

Z 1 00 


1 nn 

1UU 


1345 


gil3516897 


Mus musculus 


MLZE 


875 


47 


1345 


AAB93904 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 13862. 


360 


30 


1346 


ABB 12463 


Homo sapiens 


HYSE- Human bone marrow expressed 
protein bliQ ID JNU: 3Uz. 


C C f 

556 


1 nn 
100 


1346 


AAM / /500 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein SEQ 

11J JNU: 3 /oUO. 


34Z 




1340 


AA1V104 /3U 


riomo sapiens 


iviU-Lxi- riuman Dram expressea smgie 
exon probe encoded protein SEQ ID 
NO- ^6S^S 




i no 


1347 


gil2751092 


Homo sapiens 


PNAS-123 


306 


98 


1347 


gi9988480 


Ursus maritimus 


luteinizing hormone receptor 


72 


31 


1347 


gil 196882 


Ovis aries 


luteinizing hormone receptor 


69 


31 


1348 


gil2018147 


Chlamydomonas 
reinhardtii 


vegetative cell wall protein gpl 


227 


27 


1348 


gil5 145793 


Sus scrofa 


basic proline-rich protein 


221 


28 


1348 


gi6523547 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


220 


30 



wo 
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Jilt ID 


opecies 


j^esci lpiion 


score 
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identity 


1349 


A A DO^^TO 

AAJb>yj>U3y 


riomo sapiens 


riCji-ii.- riuman proiem t>cc|uciiuc oxj/V^ 
ID NO: 16797. 


21 87 


100 


134y 


ryiQO^l OH/1 

giyyo3oU4 


Homo sapiens 


zinc linger proiem ZjiNrzoo 


1 0^8 


46 


1349 


AAM39130 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 

IN w Z//J. 


1015 


46 


13DU 


A A A/T/1Q 1 ^4 


nomo sapiens 


JDv^/JL/J-/ - ri.Li.Illa.ll tw^/ILLUlllC' U^/l 

compound core protein 1110.23. 


471 


100 




rri 1 Qfl^ Q <\9 


urosopmia 
melanogaster 


reverse uansciipLa&c puiypiutciii 


98 


^5 


13 DU 


glZ4ZV 115 


Leisnmama 
major 


F»T^T1 • T ^ 1 ^0 9 

LJLJi. 1 , LjIO^.Z 




27 


1 33 1 


m* 1 Q9 <a 
gll y3 D43DU 


ivius mus cuius 


biiiiiiai iu bic iiuniuiugy uiicc ^oxij y 

and cysteine rich domain 


1 897 

1 07 / 


92 


13D 1 


gll /yyDOO 


ivius mus cuius 


blaU 


788 

/ uu 


47 


1351 


AAW59642 


Homo sapiens 


SHIO Amino acid sequence of human 
oidt pro ic m. 


775 


45 


1352 


AAU17301 


Homo sapiens 


HUMA- Novel signal transduction 
pain Way proiem, oet| llj odo. 


1347 


100 


1352 


AAE 11776 


Homo sapiens 


INCY- Human kinase (PKIN)-IO 
protein. 


1204 


100 


1352 


A A T T1 TJ rv/A 

AAU 17300 


Homo sapiens 


xlUJviA- JNovei signal transauction 
patnway pro tern, oeq iu qod. 


A1 R 
Ol o 


1 fin 

1UU 


1353 


gi3986121 


Rattus 
norvegicus 


synaptopodin 


78 


28 


1353 


gil53373 


Streptomyces 
griseus 


ORFC 


78 


31 


1353 


gi5002552 


Streptomyces 
griseus suosp. 
griseus 


NonC 


78 


31 




gl 1 yj>D330U 


ivius museums 


rvlXVJ_/i N N /A. ^oiwwuvvjruj gene 


260 


56 


1 1 <A 

13 j4 


AAU 1 J o / / 


riomo sapiens 


n i OD- ii urn a ii poiypepiiuc ony ij-^ 

TsJO 9776Q 
in w jl f f \jy. 


J> y 


54 ^ 


1354 


AAB58207 


Homo sapiens 


ROSE/ Lung cancer associated 

-»-\ r-vl t /t"\p»ttH /"I p cpnii pti r* f» ,Q"Pf^i TTi So, S 


259 


54 




ADDUu / OO 


nomo Sapiciib 


"RODF- Human T AMC1 69 85 SEO ID 
NO 2 


588 

U 


32 


1355 


AAM79102 


Homo sapiens 


HYSE- Human protein SEQ ID NO 
1764 


588 


32 


1355 


gi20071167 


Mus musculus 


Similar to Perl interacting protein 


558 


30 


1 1^6* 

IjjO 


cril R9Q99ZL 


X-X/"\t*v* cotMono 

norno &apiens» 


i"il"\r"\c/'vmci1 ■nrrttfiTi T ^7 
llL/*JoVJlllal |J1ULC111 JjJ / 


510 


96 


IjJO 


gl J / 1 Z 1 


rvclILUS 

norvegicus 


llDOsUIllal piOLClll 1 


510 


96 


13 JO 


gl40 1Z3Z 


riomo sapiens 


rioosomai proiem i^o / 


S1 0 


96 


1 9 


rri 9 AO C C < 0 < 

gizuyooooo 


riomo sapiens 


uiviiv i -iiKe lamiiy \^ 1 




100 


1357 


AAM42520 


Homo sapiens 


HUMA- Human kidney related 

^^Ur«Qr,f;^a OTTO TTV TvTO aOO 

poiypeptiae oJcv<; 11^ ~>oy. 


386 


89 






1 1 VJ 1 1 IU ocijJlCllSi 


WTTIVTA- Human excretorv related 

llUiVlii 111U11U11 uAvlvlul V 1 Vltl 

polypeptide SEQ ID NO 442. 


386 


89 


1358 


gi339783 


Homo sapiens 


slow skeletal muscle troponin T 


1205 


100 


1358 


gi546023 


Homo sapiens 


troponin T slow isoform; TnT slow 
isoform 


1205 


100 


1358 


gi4056562 


Homo sapiens 


slow skeletal muscle troponin T 


1205 


100 


1359 


AAU16178 


Homo sapiens 


HUMA- Human novel secreted protein, 
Seq ID 1131. 


2603 


98 


1359 


gi2 1464470 


Drosophila 


RH63657p 


487 


37 
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melanogaster 








1359 


AAM65817 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein bi}Q 

11J invj: Z01z3. 


389 


100 


1360 


AAG89164 


Homo sapiens 


GEST Human secreted protein, SEQ 

TTpv 7vT/^\. TO/I 

J.1J JN(J: zo4. 


1774 


99 


1360 


ABA05334_ 
aal 


Homo sapiens 


MILL- Human fucosyltransferase 
family member 32132 coding 
sequence. 


853 


42 


1360 


AAM47905 


Homo sapiens 


MILL- Human fucosyltransferase 
family member 32132. 


853 


42 


1361 


gil4529886 


TV /f _ 1. 

Mus museums 


DM14504.1 (novel protein) 


1 1 "~l £L 

1 176 


47 


1361 


AAU15865 


Homo sapiens 


HUMA- Human novel secreted protein, 
Seq ID 818. 


1091 


46 


1361 


AAE02779 


T T ■ 

Homo sapiens 


CURA- Human PRO-C-MG.45 protein 
encoded by DNA-C-MG.45-1776 
cDNA clone. 


1080 


A 

46 


1362 


gi2323287 


multiple sclerosis 

associated 

retrovirus 


polyprotem 


3104 


0 1 
81 


1362 


gi|331995|g 

b|AAB0309 

1 1 1 
1.1| 


AKV murine 
leukemia virus 


gag-pol polyprotein (tag amber codon 
at 2250-2252 inserts Gin in Mo- 
MuLV) 


1368 


37 


1362 


gi|535518|g 

1 I AAA yT^7 

b|AAA9267 
n 1 1 

y.i| 


Murine leukemia 

* 

virus 


gag-pol polyprotein 


1366 


36 


1363 


AAB92554 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID J\IO:10/41. 


1062 


100 


1 1 o 

1363 


AAM91 124 


Homo sapiens 


T TT T\ If A TT, . „ „ 

HUMA- Human 

immune/haematopoietic antigen SEQ 
id jnvj: lo / 1 /. 


4oo 


y I 


13o3 


m 1 1 /I 1 7on 

gii 141 /yu 


Drosopniia 
meianogasier 


nonmuscie myosm-ii neavy cnam 


111 


Ad 


1364 


gil9913468 


Homo sapiens 


similar to zinc finger protein 


519 


98 


1 j04 


/\/\ivi / oy fH 


riomo sapiens 


ri i oxi- riuman protein oiiy id inw 
1636. 


ill 

DZ 1 


OH- 




gijo /yj /o 


riomo sapiens 


zinc linger 4 1 


dz. 1 


Oh 


1365 


gil5488932 


Homo sapiens 


Similar to RIKEN cDNA 1700037B15 
gene 


1002 


100 




gi 1 /3oj /o / 


Mus museums 


Kir'oUi-iiKe protein 




yj 


1365 


gi 17432247 


Mus musculus 


SMHS1 


933 


93 


1366 


AAM39808 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 1955. 


105 


24 


1366 


AAB47142 


Homo sapiens 


INCY- CDIFF-24, Incyte ID No. 
15400J3CD1. 


101 


24 


1366 


AAM41594 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 


98 


24 


1367 


gi6409379 


Homo sapiens 


zinc finger protein ZNF229 


2299 


100 


1367 


gil0864174 


amino acids 1- 
420] [Homo 
sapiens 


ZNE229 


2299 


100 


1367 


AAB95278 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 17486. 


1973 


48 


1368 


AAG74304 

• 


Homo sapiens 


HUMA- Human colon cancer antigen 
protein SEQ ID NO:5068. 


933 


100 
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ID 
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S 

still c 


Percent 

iiitii ti i y 


1 1 £Q 

13oo 


A A flfl/l fM o 
AALrU4U4V 


Homo sapiens 


LtHo i Human secreiea protein, ony 
ID NO: 8130. 


HO / 




1368 


gil61292 


Loligo pealei 


neurofilament protein 


96 


22 


i3oy 


^Till/I nm 
glZ1314yU/ 


Homo sapiens 


DVD T\J nnn^inmrr A "D A T7 1 1 i Ira Drntdin 

YK1JN -containing /\Jr/\Jri-iiKe rrotein 

7 


H-ZOZ 


oy 


1369 


AAE07514 


Homo sapiens 


MILL- Human PYRIN- 1 protein. 


2156 


49 


1369 


gil5l8z339 


Homo sapiens 


JrY R1N -containing ArAr 1 -liKe protem 

-i 
l 


Z 1 JO 


4^ 


1370 


gil3 183793 


Homo sapiens 


CECR2 protein 


6203 


97 


1370 


AAM70233 


Homo sapiens 


MOLE- Human bone marrow 
expressed prooe encoaea protem oiiv^ 
ID NO: 30539. 


1 J /z 


1UU 


1370 


AAM67oOo 


Homo sapiens 


MULh- Human bone marrow 
expressea prooe encoaea protein oxik^ 

1JL/ 1NU. Z O 1 OH- . 


1 <T) 

1Z> /z 


1UU 


1 1*7 1 
1.3/1 


gllUoUUoDo 


nomo sapiens 


axninopepnaasc r> 




yy 


1371 


gil5082509 

• 


Homo sapiens 


arginyl aminopeptidase 
^airtiriopepiiaase u j 


3437 


98 


1371 


gil0933784 


Homo sapiens 


aminopeptidase B 


3433 


98 


1372 


AAE 15250 


Homo sapiens 


INCY- Human RNA metabolism 
protein-13 (RMEP-13). 


2339 


99 


1372 


gizU072721 


Mus musculus 


KlKh-N CJJJNA 31UUUU4rZz gene 


ZZZJ> 


Q 1 

y i 


1372 


gi2098575 


Homo sapiens 


F25451 2 


1602 


99 


1373 


gi6841246 


Homo sapiens 


HSPC298 


528 


90 


1373 


gi2384748 


peach rosette 
mosaic virus 


T> "VTA -1 1 j • 

RNA1 polyprotem 


71 


33 


1373 


gi|6841246| 
gb|AAF289 
/o.l|Ar lol 
416 1 


T T * 

Homo sapiens 


HSPC298 


coo 

528 


90 


13/4 


gi53zl 13 


Caenorhab ditis 
elegans 


homeotic region most like 
HMPB_DROME: homeotic 
proDOScipeaia protem 


oil 
Zl 1 


9 1 
Zl 


1374 


gi 18447198 


Drosophila 
melanogaster 


GH09355p 


209 


19 


1 in a 
13/4 


gloyo41oU 


Streptococcus 
cristauis 




srpA 


iyu 


1 n 
i / 


ID / J 


m*9 1 /l^n^lQ 
glZ L^DUDJO 


urosopiuia 
melanogaster 


T X>1 1 Z£A-n 

L,r 1 lD04p 


ooz 


jj 


1 

13 / J 


AA1V141UUU 


Homo sapiens 


H Y oh- Human poiypeptiae o-ti^ iu 
NO 5931. 


9R9 
ZoZ 


97 
Z / 


1 J / J 




Homo sapiens 


H Y ob- Human polypeptide oxi^ iu 
NO 5930. 


9Q9 
ZoZ 


97 

Z / 


1376 


AAG663 1 1 


Homo sapiens 


BIOD- Human zinc finger protein 46. 


2311 


99 


1 T7Z" 

13/6 


gil9548796 


Mus musculus 


multmnger protein mKr2 


zzuy 


Q < 


1376 


gi4567180 


Homo sapiens 


BC37295 2 (partial) 


1765 


64 


1 j / / 




Homo sapiens 


HUJViA- Human novel secreiea proiem, 
SEQ ID 201. 


j\j /y 


OR 
yo 


1377 


gil9263612 


Mus musculus 


Similar to expressed sequence 
AI413166 


1862 


87 


1377 


AAM67519 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein SEQ 
ID NO: 27825. 


808 


100 


1378 


ABB05653 


Homo sapiens 


BODE- Human DNA binding protein 
RFX2-89 SEQ ID NO:2. 


3952 


93 
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1378 


AAB94287 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO:l4728. 


3143 




1378 


gil 1994733 


A 1*1 * 

Arabidopsis 
thaliana 


contains similarity to cell wall-plasma 
membrane linker 
prolein~gene ia:MrvAZ3.j 


1 AH 

34 / 


ZD 


1379 


AAloooz /__ 
aal 


Homo sapiens 


JVllJLrl^- rluman protein Kinase 
poiypepuae Z3j4o coaing sequence. 




X\J\J 


1379 


AAG65766 


Homo sapiens 


MILL- Human protein kinase 
poiypeptiQe z3j4o. 


3993 


100 


1 IHCi 


F.: 1 oil a 
gll 5 / 34 


Lriycme max 


jLyiN/v-Qireciea. r\j.>Jr\ poiynieiase 


it / 




13oU 


rriQAHOA 

glo'r / /Z4 


riomo sapiens 


meinyiinioaQenobine priobpriuiyiabe 


i tut 




1380 


gil31 11476 


Mus musculus 


methylthioadenosine phosphorylase 


1377 


90 


1 O OA 

13oU 


gi!3Z / /yyyJ 


ivius musculus 


ivirvJc/iN cuiNA i3uuuiyizi gene 


i in 

13 l f 


on 


1 TOI 

i3i5 1 


glIZw)o4o41 


riomo sapiens 


rl 1 U30 


9.1 1 


04 
^t 


1 1Q1 

13ol , 


gllZDo4o3y 


riomo sapiens 


xl 1 U30-lo^J 


Oj 1 


04 


1381 


AAG03594 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID JNiJ. /O /Z>. 


403 


88 


1382 


gil2584839 


Homo sapiens 


HT036-ISO 


986 


85 


1382 


gil258484l 


TT * 

Homo sapiens 


HI 03 6 




o3 


1382 


AAG03594 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID NO: 7o/D. 


381 


70 


1383 


gil2584839 


Homo sapiens 


HT036-ISO 


916 


99 


*i o ft o 

1383 


gil258484l 


Homo sapiens 


HT036 


73y 


yy 


1383 


gil7427028 


Ralstonia 
solanacearum 


CONSERVED HYPOTHETICAL 
PROTEIN 


367 


46 


1384 


AAM00883 


Homo sapiens 


HYSE- Human bone marrow protein, 
SEQ ID NO: 359. 


574 


99 


1384 


AAM00770 


Homo sapiens 


HYSE- Human bone marrow protein, 
SEQ ID NO: 133. 


573 


99 


1384 


gil7862960 


Drosopmla 
melanogaster 


SD05477p 


349 


54 


1385 


AAB95751 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 18660. 


173 


32 


1385 


AAY16782 


Homo sapiens 


GEMY Human secreted protein (clone 
cg426 8). 


142 


50 


1 1 o c 

1385 


AAM86749 


TT" ■ _ 

Homo sapiens 


TTT TA yT A T X ^ 

HUMA- Human 

immune/ naemat op oie tic antigen oii^ 

lXJ 1NL^.1434Z. 


1 1 Q 

1 lo 


3Z 


1386 


gil9879436 


Homo sapiens 


COASTER 


5431 


95 


1386 


AAB92536 


T-r • 

Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO:10701. 


ZOoZ 


1UU 


1 1 O f 

1386 


AAU16460 


TT 

Homo sapiens 


HUMA- Human novel secreted protein, 
beq ID 1413. 


1 0^/1 

loo4 


yy 


1387 


gi6 103649 


Homo sapiens 


F-box protein FBX10 


2878 


99 


1387 


AAY83080 


TT " 

Homo sapiens 


UYNY F-box protein FBP-12. 


9o3 


1 nn 
1UU 


1387 


gi6 164739 


Homo sapiens 


T~> 1 i • T->1 1 

F-box protein FbxlO 


963 


1 nn 




gll!>UoZ4Zo 


Homo sapiens 


oimiiar to KiivxiiN cujna zoiuuddt 11 
gene 


1 RAO 


QQ 


1388 


gil3435795 


Mus musculus 


Similar to RIKEN cDNA 2810055F1 1 
gene 


1661 


88 


1388 


AAU27647 


Homo sapiens 


ZYMO Human protein AFP301973. 


673 


99 


1389 


AAM38689 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 1834. 


2085 


66 


1389 


gi38032 


Homo sapiens 


ZNF43 


2085 


66 


1389 


gil6306806 


Homo sapiens 


zinc finger protein 43 (HTF6) 


2085 


66 
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1390 


AAB95279 


Homo sapiens 


HELI- Human protein sequence SEQ 
TD NO- 17488 


3194 


100 


1390 


gil4329547 


Homo sapiens 


dJ1104A8.1 (novel protein) 


629 


99 




gloZ JU 1 O 1 


uj. o b op mid. 
melanogaster 


D-Titin 

Jl-/ - 1 1 1111 


160 


20 






JJ-OIIIU bdpiCllo 


PVTA- T-Tiimnti Virjf*cin irintnt" "nrotf*in 

Y_> X X w X X Llll.lu.il ivlUC/OHl 111VJ IVJ1 JJ1 v_J IV Hi- 

HsKip3b motor domain. 


972 


85 


uy i 


gllZoOZOU / 


ivius muscuiub 


Irinpcin cnn t*"Fq m 1 1 "\ / ntAtpi n 1 Q A 
ivlIlCoIll blipCllcllllliy piUlClll 1 


663 


94 


uy i 


A 1 A 
AJt5£5U /<+ 1U 


nomo sapiens 


(~*"S/Tf\ "PTllTYlCin l/'inpoin tTK^tr^T t>t/*^ "ff^l fl 
I 1 vJ- XT.Ullld.il JsJLIlCblll I11UHJX piULCXll, 

XXiXVlU 


646 


48 ! 


1 ^07 
1 jyz 


tri 1 Q4R4 1 
gliy^foH- 1 J7 


IVXllo lllUbL/UXUo 


Similar tn RTKFN rDNA S410428K15 

OllIlllCll IAJ XvXXVX_/l>l V^l-' 1 > / v JtJUtLOIxlJ 

gene 


807 


79 


1 709 

I s>yz. 


A A RGA&47 


nomo bapiciib 


TTThT T_ T-Tumcin •nrr^tfMn cpniiprirp ^IT^O 

XXi_/-l— (1 XXLlIllCtil UHJICXLI OCV^U.C/IIUC' O !_/ V^/ 

ID NO: 16016. 


401 

I W 1 


98 


1 J7Z 


gUO / JJOJ 


^aCIiUlllaDUXLlo 

elegans 


L/Z,UOJ . Z, 


121 
i i 


29 


1 ^Q^ 


CM 1 ^449786 


iV/lnc miicfn lnc 
IVXLlo lllliolslilLlo 


l~Jrf*tnTin1 fi 


111 


70 


1 ^0^ 


oil 7S 1 1 70Q 
gi i / j 1 1 / \jy 


T-T r\ m r\ c ar^l F*"n C 
IlVjlllU otlUlClib 


L1W Wll 1 CgUldlV^U '-'y v/UlilU 1 } CI 


765 


66 


1 ^0^ 

i d y d 


cn'1 ^4497R4 


i. ivjiiivj oauiciio 


DRCTNNR1 A 


752 


66 


1 ^04 


n.D 1J \J O ~) \J\J 


11VJ111VJ odUlbllo 


CtPN7 Human eTT* 1 ^ amino acid 
sequence. 


1785 


95 


1 ^94 


glx-J J 1 JOU 


XXUllivJ OO-LJlt/llO 


translation initiation factor eTF3 r>40 
subunit 


1785 


95 


1394 


gil2653235 


Homo sapiens 


eukaryotic translation initiation factor 

^ <jiiT^iinit ^ ^ era mm a 40VT~)^ 

J) ? O LIU LAI 11 L J ^gdllliild, '"T\J±\A-S J 


1785 


95 


1395 


gi3 169261 


Mus musculus 


T-box transcription factor 


1584 


98 




m*7 1 9C2£7 

glj IZojoZ 


ivius muscuius 


IVAlTll. DaIt 1 


X J7 O 


98 


IjyD 


m'9 1 440^ 40 
glZ iHr^-tyDH-KJ 


jLydiuo rcrio 


X - UUA UUllLa-XlXlllg 11 aXloLvl ip LlvJll XaLvlUl 


1 366 


85 


1396 


AAB95361 


Homo sapiens 


HELI- Human protein sequence SEQ 

TFl NO* 1 7667 

1JJ 1NU. 1 / DO / . 


702 


99 


1396 


gil9071496 


Pseudomonas 
syrmgae pv. 
maculicola 


type III effector HopPmal 


100 


31 


1396 


gi7140837 


Streptomyces 
seouiensis 


dihydrolipoamide acetyltransferase 


98 


34 


Ljy 1 


glizOJO 1 OD 


riomo sapiens 


zinc imger proiem zoo 




i no 

1 UV/ 


1397 


AAZ35253_ 
aal 


Homo sapiens 


UYSH- Human leucine zipper protein 
gene (BMH) (CBLAYD06). 


2646 


99 


uy / 


A A VQ 0/IT7 

AA Y3z4z / 


riomo sapiens 


U i on- Human leucine zipper protein 
CBLAYD06, 


ZOHU 


yy 




gl4DooyZ 


riomo sapiens 


nomoiogous to mouse gene 
PC326:GenBank Accession Number 

A/TQ^^64 


904^ 


O / 


1398 


gi200241 


Mus musculus 


protein PC326 


1600 


59 


1398 


AAM73935 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein SEQ 

XXv 1NW. JtZtl , 


818 


100 


1399 


gil7223819 


Homo sapiens 


transcription cofactor vestigial-like 2 
protein 


697 


95 


1399 


AAM94333 


Homo sapiens 


HUMA- Human reproductive system 
related antigen SEQ ID NO: 2991. 


156 


54 


1399 


gil809231 


Homo sapiens 


coded for by human cDNAs R76043 
(NID:g850725), R65857 
(NID:g838495) and H12868 
(NID:g877688) 


156 


54 
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C1?A 

ID 


JJL1 1 LxJ 




T)pcr»r , intif»n 

1/vOvI 11JI.IVAK 


s 

few? 

score 


Percent 
identity 


1 400 

1 *-r\J\J 




XJLU1J.1U OdJJlC/llo 


aYonpmal Hvnpiii lipaw rhain Ol^AHS 


5798 


97 


1400 


gil8449111 


Mus mus cuius 


axonemal dynein heavy chain 5 


5549 


92 


1 400 


cti 1 4T^S444 


XXUlllU cjcipiCllb 


o -v r^n F»"m !3 1 H\/tipiti lip^w pVinin R 
dA.uiiciiiai uyiiciii ucav y villain o 


4341 


70 


1401 


AAG03810 


Homo sapiens 


GEST Human secreted protein, SEQ 

TF> "MO* 7RQ1 
ixv in V-/ , / o y i . 


760 


92 


1401 


gil86800 


Homo sapiens 


ribosomal protein L12 


760 


92 


1 AC\ 1 
14U 1 


gi I / ^yu / Jl 


ivius muscuius 


llDODUIlld.1 piUlClil XvlZ 


7S9 


91 


1402 


gil7512582 


Homo sapiens 


Similar to RIKEN cDNA 2700059L22 
gene 


1611 


100 


1 a no 
14UZ 


A A A /f Q <C O Q 1 


xiomo sapiens 


rxUiViA- xiuman repruciucLive byaLcni 
related antigen SEQ ID NO: 3889. 




07 


1402 


gi 18447648 


Drosophila 
meianogaster 


SD06318p 


161 


22 


1 A f\1 

14U3 


A A T TO C\< 1 A 

AAUZUj 14 


riomo sapiens 


xiuiviA- xiuman secretea. proicm, oecj 

TTi "Mrk ^0^ 

JLi-J 1NO JUG. 


ZjJ 


^9 


1 AC\1 

1400 


A AV < s40 < %7 
AA Y 04LO / 


riomo sapiens 


VJTXlXXV-/ xYIlllIlU clUlLl bCmXCllL/C Ul 11UVU 

DNA cytosine methyltransferase 

X-/1N1VX X — '^A.. 


14^ 


1Q 

i y 


1403 


gi295671 


Saccharomyces 
« . 

Ccrcvisid.c 


selected as a weak suppressor of a 

mii-tarvt rt"F tVi*=» cnVinnit A ^40 rvfTlTVJA 
llllXluIlL Ul L11C bUUUlllL rW^>*T\J Ul X-'INxA. 

A f*y^ pnrlnnt T? T\T A nnlvmprflQp T anH TTI 

Lit/ t/l 1U Clll L ±\J. > A }J\JLy 11L\s1CIO\s ± a.l±\J. All 


143 


19 


1 404 


£^1 JL^J/JO^/Jl 


XXUlllU od-jJlC/ilo 


Similar to 7inr fincrpr nrntpin 

OllllllCll LU il/lllV-' lUlgUI VJLVJ ill iuU 


3615 


99 


1404 


gil 86774 


Homo sapiens 


zinc finger protein 


2626 


55 


1 404 
1^+ KJ^t 


glZ / J^J J S> 




jLiLS r y i Xj 


2S79 


54 


1405 


AAM95368 


Homo sapiens 


liUMA- Human reproductive system 

rplntpH antiopn 9FO TD TsTO- 4096 

1 ClclLCLl dllLlgCli OXjV< 1JL/ INw. tuZ-U. 


792 


100 


1405 


AAY30795 


Homo sapiens 


USSH A human trichohyalin (TRHY) 

Ul U IC111. 


128 


22 


1405 


gi292836 


Homo sapiens 


trichohyalin 


128 


22 


1406 


AAW41936 


Homo sapiens 


GEMY Secreted protein AS 1 52 1 . 


177 


100 


1 A C\C 

14U0 


AAW 4zUoz 


xiomo sapiens 


Oxiivi i i ne amino acia sequence oi ine 
AS 152 1 protein. 


1 77 
III 


1 oo 

1UU 


14U0 


giiyyuo /ou 


Aspergillus 

« 

nomius 


A T7T T? 
Ax'l^XV 


70 
/ y 


^ 1 

J 1 


1 407 


A A"RQ4*\41 


xiomo sapiens 


XXXIXjI - XlUllldll piULClll oCmXCHL/C OEy 

TD 7STO-1 S786 

X1_J" 1 N \_J . 1 Ji-OVJ. 


1 020 


100 

1 Uv 


1 407 

1 *tU / 




V-/U1 y 11C U IC1 1 1X111 

crliitaTTii mm 

gill LCUX1X V/ 14X11 

ATCC 13032 


T'r a n <j1 n ti fvn iniiHation faptm* 9 ^ CrTPa^lP, 1 
i laiioictiiuii miLicitiuii iciviui i vj xx ciow / 


124 


30 


1 407 


cri 190448^9 


v_sllla.Hl y UAJlllUllcio 

reinhardtii 


1/liAll UUA Xvl N ii. ll^llV/Ctov^ 


124 


33 


1 408 


cn'1 ^46QR01 
gllJHOi/OUl 


XXUlllU octpiciib 


LdllUClll X 1 V Hi llilgCio _ l UlULC/Xll 


1874 


100 


1408 


gil0834636 


Homo sapiens 


double FYVE-containing protein 1 


1874 


100 


1 A HQ 
14Uo 


gl JL QDOy i iy 


xiomo sapiens 


pnospnoinosiiiae-omuing proiem 


1 £74 

X O / *T 


100 

1 Uu 


1409 


ABB04291 


Homo sapiens 


UYNA- Human testis-specific protein. 


2586 


100 






xiomo sapiens 


testis-speciiic protein i or-iN y 


ZJOD 


1 00 
1 uu 


1 400 


A AIVTQS^RQ 

/VrVlvlj/ _) _> o y 


XXUlllU bapiCIlb 


T-TT T1V/T A — T-TnTnci'n Tprwrtrliintix/'p c\/ctprin 

XXLJIVX^T. - llLLllldll 1CJJ1 UL1L1U Ll VC oyolt/lll 

related antigen SEQ ID NO: 4047. 


1477 


98 


1410 


gil9168684 


Encephalitozoon 
cuniculi 


similarity to HYPOTHETICAL 
PROTEIN YANE SCHPO 


262 


33 


1410 


gi476061 


Saccharomyces 
cerevisiae 


YBR0834 


237 


29 


1410 


gi3237301 


Saccharomyces 
cerevisiae 


Vid24p 


237 


29 


1411 


AAM76193 


Homo sapiens 


MOLE- Human bone marrow 


238 


100 
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score 


jr erceiii 
identity 








expressed probe encoded protein SEQ 
ID NO: 36499. 






1411 


AAM63381 


Homo sapiens 


MOLE- Human brain expressed single 
exon probe encoded protein SEQ ID 
NO: 35486. 




1 fin 

10U 


1411 


gi5669894 


Human 

1 • „ o 

herpesvirus 8 


latent nuclear antigen 


z 10 




1412 


gil9484l59 


Mus musculus 


Similar to RIKEN cDNA 5430428K15 
gene 


955 


80 


1412 


AAB94842 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID iNO: loUlo. 


406 


100 


1412 


gi3875383 


Caenornabditis 
elegans 


DZVoj.Z 


10 / 




1413 


gil5080358 


Homo sapiens 


Similar to LOC88745 


1289 


98 


1413 


^ ' s~ O A 1 1 C/T 

gi6841 156 


Homo sapiens 


rlor'L/ZDJ 


Q1 9 




1413 


^.'1 /z^i /zc\ A r\r\ 

gil 6769400 


Drosoplrila 
melanogaster 


L^DZyZ /Op 




^A 


1414 


A A voocon 

AAY2o5o9 


Homo sapiens 


jjainjj 11A.-1 Dinoing protein z 
(TIABP2). 


ZODU 




1414 


AAW64717 


Homo sapiens 


D AND Human TIABP2 protein. 


2860 


95 


1414 


AAR72830 


Homo sapiens 


DAND Human TIABP2. 


2860 


95 


1415 


AAH23820 
aal 


Homo sapiens 


INCY- Human transferase HTFS-20 
cDNA, SEQ ID NO: 62. 


1252 


100 


1415 


A AU 11627 


Homo sapiens 


BOD A- Human zinc finger protein 26. 


1 O CI 


1UU 


1415 


AAB73513 


Homo sapiens 


tt"k T/~i"\ T TT j r* TTTT7CI O f\ 

INCY- Human transferase HTFS-20, 
SEQ ID NO: 20. 


1252 


1 AA 
100 


1416 


gill63174 

* 


Rattus 
norvegicus 


similar to yeast Sec6p, Swiss-Prot 
Accession Number P32844; similar to 
mammalian Jb>94, bwiss-rrot Accession 
Number Q03 169; Method: conceptual 
nansiation suppnea oy autnor 


3oo 


ZD 


1416 


gil /9j J 1 


rlomo sapiens 


ijy4 protein 






1 A 1 C 

1416 


A A "\Z C 1 1 1 C 

A A Y J 1 1 1 j 


rlomo sapiens 


liNv^ i - numan rioriv^/O protein. 






1417 


gil 04404 16 


Homo sapiens 


FLJ00043 protein 


6728 


95 


1417 


'10^"/\/11 /(O 

gil3604l43 


Mus musculus 


tangerin A 




o / 


1417 


a a a if r\r\o £1 A 

AAM00864 


Homo sapiens 


rlYori- Human Done marrow protein, 

OE^ 1U jW. 


1 QA7 


QR 

70 


1418 


AAU 17059 


Homo sapiens 


HUMA- Novel signal transduction 
patnway protein, oeq iu ozh. 


2491 


99 


1418 


gil 8026290 


Homo sapiens 


Gab3 


2491 


99 


1 /l 1 o 

1418 


gil 8026288 


Mus musculus 


Cjabi 


1 QPM 


/ O 


1419 


AAU 1 7059 


Homo sapiens 


HUMA- Novel signal transduction 
patnway protem, oeq llJ 024. 


2886 


99 


1419 


gil 8026290 


Homo sapiens 


Lrabi 




QO 

yy 


1419 


gil 8026288 


Mus musculus 


Gab3 


2178 


74 


1420 


gil3528684 


Homo sapiens 


Similar to ribosomal protein S6 kinase, 
dz,L\±j, poiypepiiQc i 


Z 1 D 1 


1 AA 


1420 


AAB65612 


Homo sapiens 


SUGE- Novel protein kinase, SEQ ID 
NO: 138. 


2695 


94 


1420 


AAM41924 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 6855. 


546 


55 


1421 


gi2330003 


Gallus gallus 


glutamine rich protein 


151 


26 


1421 


gil 5292043 


Drosophila 
melanogaster 


LD36157p 


141 


23 


1421 


gil59333 


Leishmania 


glycoprotein 96-92 


132 


25 
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ID 


lilt ID 


species 


uescripiioii 


score 


identity 






major 








1 A 

14ZZ 


AAriUOOl / 


riomo sapiens 


O A A TJiimon fw/^fvaiTi 1i*a\7inrr 

o/\*JTA i Hi 11 la 11 pioicin naving 

XXy UXULJIXUUXC/ LHJIllalll, 111 lu / JU. 


Q70 


89 


1 A99 


A A*M"9^467 


XXIJIXIU odpiCllo 


T-IA^C Th _ Mnman irrntpin Qpniipnrp ST^O 

XX X OXj XT1U.XXXCIXX LXIULGXXI oC^UWlwt uljy 

ID NO:982. 


970 


89 


1 J.99 


oil S 9 09607 

gii jzyzou / 


T^mc rYrVtn 1 ci 
J_xl UbUlJXXXXci 

melanogaster 




182 


25 




A AA/TQ1zl^9 
A A1V1.7 O 4 DZ 


T-T/"\VVl/~% OQTMPtlO 

XIOIIIU adpiCIlo 


TTFT T- Riirnan r»n1vnpntide SFO TD 

X xX-jX-jX. XXLlIXXclXX puiypt^p Uldt, JJby XJ-/ 

1MO- ^124 


1284 


100 


1423 


gi32 19288 


Mus musculus 


PA28 gamma subunit 


1284 


100 


1 /I 91 
14ZJ 


rxiiK 1 9nn 


ivius muscuxus 


XVI allLlgCIl 


1 9R4 


100 

A \J\J 


1/19/1 
14Z4 


glZ I 1 U440Z 


rionio sapiens 




1 

X J \J 


1 00 


1424 


gi20805945 


Drosophila 
me l ano g a s ter 


wingful 


675 


45 


1424 


gi20269077 


Drosophila 
melanogaster 


Notum protein 


675 


45 


1 /i i c 


gll j /ZloOU 


L^iona intestinaiis 


/4i main inffSrrrtD/liQi'O in ^ 

aynem miermeaiaie cnain o 


/ Dy 


40 


1425 


gil817526 


Anthocidaris 

* * 

ciassispma 


intermediate chain 1 


735 


46 


1425 


gi75 80490 


Homo sapiens 


NM23-H8 


473 


40 


1426 


A A T TO 1 no /I 

AAUol9o4 


Homo sapiens 


1JNC Y- riuman secretea protein 
SECP10. 


434 


77 
/ / 


14zo 


glOJOU /zZ 


Yersinia 

pseudotuberculos 
is 


D-unit poiymerase-UKe protein 


R9 
oZ 


9^ 


1426 


gi6735326 


Arabidopsis 
tnaiiana 


phenylalanine-tRNA synthetase-like 
protein 


76 


30 


1427 


ABB 11972 


Homo sapiens 


HYSE- Human VM106R. 1 
homologue, SEQ ID NO:2342. 


698 


60 


1427 


gll 6648066 


Drosophila 
melanogaster 


CjrlUooiUp 


DUU 




1 /I <"> '"7 

1427 


* 1 O O /*\ /I AC 

gi3880445 


Caenorhabdms 
elegans 


contains similarity to Pfam domain: 
PF02214 (K4- channel tetramerisation 
domain^, ocore— /y.r>, n-vaiue— z.3e-zu, 
N-l 


D 1 / 


'Xl 

j I 


1428 


A A Tl r\f\ £Z"~J A 

AAJbOOo/4 


Homo sapiens 


riUiVlA- riuman pro tern tyrosine 
kinase receptor (PTK) from clone 

xlooJy £ f j. 


1 1H9 
1 ^UZ 


1UU 


14Z5 


a a iv/f ^on^o 


riomo sapiens 


xx x oJj/- xiuman poiypepuae oxj/^/ x.u 
NO 2184. 


1 1 07 
I 1U / 




1428 


gil6648322 


Drosophila 
melanogaster 


LD29155p 


514 


37 


1429 


gil2803785 


Homo sapiens 


Similar to cell cycle progression 2 
protein 


2341 


100 


1429 


AAB94081 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 14279. 


2334 


99 


1 A9Q 


ai91^9Q09 
glZ j DAy\JA 


norno sapiens 


ocxx uyoxc pXUgXCboXtJXX ^ pxuicxxx 


1 686 


94 


1430 


gi2 1040522 


Homo sapiens 


similar to kinase D-interacting 
substance of 220 kDa; ankyrin repeat- 
rich membrane-spanning protein 


1511 


99 


1430 


AAU20586 


Homo sapiens 


HUMA- Human secreted protein, Seq 
ID No 578. 


1387 


100 


1430 


AAE01031 


Homo sapiens 


HUMA- Human death domain- 
containing receptor (DDCR) protein 
from HFIHQ20 clone. 


1259 


100 
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1431 


gill 875764 


Homo sapiens 


neurogenin 2 


1301 


100 


1431 


gil 504095 


Mus musculus 


DNA-binding protein 


1146 


83 


1431 


gil666910 


Mus musculus 


neurogenin 2 


1138 


o ^ 

83 


1432 


gil9067867 


Pseudonaja 
textilis 


pseudonajatoxin b precursor 


74 


34 


1432 


gi5230714 


Pseudonaja 
textilis 


long neurotoxin precursor 


74 


34 


1432 


gi|14250573 
|gb|AAH087 
41.1|AAH08 

'"f A 1 

74 1 


Homo sapiens 


LIM protein (similar to rat protein 
kinase C-bmdmg enigma) 


67 


40 


1433 


gil881738 


Acanthamoeba 
castellami 


myosin-I binding protein Acanl25 


366 


28 


1433 


gil470l866 


Dictyostelium 
discoideum 


carmil 


325 


25 


-* A -""1 

1433 


gi20087129 


Dictyostelium 
discoideum 


DEVELOPMENTAL PROTEIN 
DG1112 


139 


23 


1434 


gil2584159 


Homo sapiens 


zinc ringer protein 268 


5261 


100 


1434 


gil4579579 


Homo sapiens 


ZNF268B 


4395 


100 


1434 


gil9851928 


Homo sapiens 


CLL-associated antigen KW-4 splice 
variant 2 


4342 


98 


1435 


ABN00011_ 
aal 


Homo sapiens 


AEOM- Human genome-derived 
myosm-like protein encoding cDNA 
SEQ ID NO:2. 


2623 


98 


1435 


ABB06333 


Homo sapiens 


AEOM- Human genome-derived 
myosin-hke protein SEQ ID NO:3. 


2623 


98 


1435 


gil5718364 


Homo sapiens 


myosin heavy chain 


2623 


98 


1436 


gi!7945712 


Drosophila 
melanogaster 


RE31178p 


360 


40 


1436 


gil8491231 


A t * 1 * 

Arabidopsis 
thaliana 


At2g41750/T11A7,15 


167 


26 


1436 


gi987652 


Escherichia coh 


hypothtycal protein m UNG 3 region 


1 16 


25 t 


1437 


AAM79289 


Homo sapiens 


HYSE- Human protein SEQ ID NO 

1 Af 1 

1951. 


470 


O A 

84 


1437 


AAG03927 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID NO: 8008. 


470 


84 


1437 


AAB57110 


Homo sapiens 


ROSE/ Human prostate cancer antigen 
protein sequence SEQ ID NO: 1688. 


470 


84 


1438 


AAW29658 


Homo sapiens 


GEMY Homo sapiens BH272_3 clone 
secreted protern. 


829 


89 


-t /loo 

1438 


gi2282038 


T T • 

Homo sapiens 


»^ 1 A 

p2l-Arc 


o o n 

829 


on 
89 


1438 


gi6983853 


Mus musculus 


Arp2/3 complex subunit p2l-Arc 


814 


87 


1439 


ABB 50 184 


Homo sapiens 


INCY- Human transcription factor 
TRFX-35. 


i /i rA 

1459 


57 


1439 


AAB93164 


Homo sapiens 


HELL Human protein sequence SEQ 
IDNO:12091. 


1425 


65 


1439 


A A TV K A /\0 AO 

AAM40308 


TT • 

Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 3453. 


1418 


C 1 

51 


1440 


AAU27656 


Homo sapiens 


ZYMO Human protein AFP548753. 


537 


99 


1440 


AAU21102 


Homo sapiens 


HUMA- Human novel foetal antigen, 
SEQ ID NO 1346. 


337 


100 


1440 


AAM75756 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein SEQ 
ID NO: 36062. 


313 


100 


1441 


gi9294050 


Arabidopsis 


protein kinase-like protein 


106 


30 
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ID 


Hit rn 

XXI t XX/ 


QriOfi at; 


x/csLI IJJliUH 


fi 

score 


X CI CCll I 

identity 






■f It 51 1 i o n a 
ixx&xx alia 








1441 


gil5983765 


Arabidopsis 

lllallalla 


AT3g24550/MOB24_8 


106 


30 




ail 1877617 

gXl JO / / O 1 / 


.rYX ctL/lU.UJJo lo 

tli a linn a 

MXC1XXCIX1.CI 


pruiein Jonase-iiKe proieiii 


1UO 




1442 


ail 1064131 


TTnTTin Qanipric 

11U1UU OulJlvllO 


1VR19QT"j4 1 9 ^nnvpl -nrntfin icnform 

UJ~> J^i<j7J_/ t T. X .Z> ^HUVCl UHJlt/lXAj loVJXWXJ.ll 

2"! 
^ ) 


40^ 


07 


1442 


gil 1064132 


Homo sapiens 


bB329D4.1.1 (novel protein, isoform 

1 1 

1 ) 


383 


89 


1442 


gil 1064130 


Homo sapiens 


bB329D4.1.3 (novel protein, isoform 


374 


98 


1443 


eil9347650 


xxunxu oauiv^iio 




1 3S6 

X J O \J 


1 00 

X 1/1/ 


1443 


gi20380760 


Homo sapiens 


BNIP-2 similar 


1386 


100 


1441 

X ' rrJ 


ail 01476^9 


xxuxixvj bcljJICilb 


RTJTP-^hf»t5» 

OXNxJr-ijUCla 


1 P1R9 


QQ 


1 444 


ai?4?916? 


Si 51 n li i m nlniim 

Ocilllcil lllll &1UUX.1X 


nrnlmp rif^Vi nvnfpi ti 

jpxuxxxxc xxl-xx pxutciii 


1 Ad 

1^4 


J X 


1444 
i iii 


ail 6772R 


T^) i r* tvr» <3 1 i n m 

discoideum 


tT*5i 115-54^^11 a fiiftnr 

IX CtXXo a\j Lillg XcHUVJX 


1 97 

x z. / 




1444 


gi4539386 


Arabidopsis 
thaliana 


extensin-like protein 


122 


27 


1445 


gil 7223626 


Homo s aniens 


ATP -bind in j? cassette A 1 0 


378 


100 


1445 


AAU18378 


Homo sapiens 


HXJMA- Human endocrine polypeptide 
SEOIDNo333 


277 


72 


1445 


gil 7223 624 


Homo sapiens 


ATP -binding cassette A9 


277 


72 


1446 

A. I I \J 


ei633990 


TVfn«s miiQcnliiQ 

IVIUj UlUovUlUO 


r 71Tir» "fin rrpr tll'ntplll 

ZjXIH XXXXg^X LJXV/LC<XXX 


1 064 


ou 


1446 


ai4481920 


1VTii<i miKfnliiQ 


KJ£jL X X JUX KJ IC111 


1 061 

X .7U X 


86 1 
ou 


1446 


gil336158 


Rattus 

11AJ1 V GglV-<lXo 


pancreas only zinc finger protein 


1954 


85 


1447 


gil3186114 


Homo sapiens 


rab interacting lysosomal protein 


1850 


92 


1447 


ai451QS60 


o ii cp luixiy ucb 

coelicolor A3 (2) 


xiiiegxai memuxane proiem wiin Kinase 
activity 


1 zlQ 

i 


ZO 


1447 


gil6648230 


Drosophila 
meianogd.i> ter 


GH23825p 


139 


22 


1448 


gil3186114 


Homo sapiens 


rab interacting lysosomal protein 


2047 


99 


1 AA% 


rr'ii4<s1Q*\£fi 
gl*+03!7Z)OU 


otreptomyces 
coeiicoxor /\Dyz. ) 


integral memorane protem witn Kinase 
activity 


1 /fO 


ZD 


1 44R 


gl70JO /OX 


F'Qr'i 1 1 ti i tti 

Col LLlGlllU-lll 


R API 0 1 ^ 

Xj/\^.- 17.1 J 




91 


1449 


A AE0 1 693 


XXVJ11XU OO-jJlt/llO 


T4T 1\A A - T-Tiiman apnp r )'~) pnpnH 

X X J. VX.T*. XXlXXXXdlX HC/XXC CXXC/UUCU 

secreted nrotein HOFNMS^ SFO TD 

O^UlLiLvU piuiuui XXV_/X X > 1 VX _> .J 5 ul-/V^ XX-/ 

NO- 105 


114 

Jit 


67 


1449 


ABB90444 


Homo sapiens 


HUMA- Human polypeptide SEQ ID 
NO 2820 


115 


53 


1449 


AAB50967 


Homo sapiens 


GETH Human PRO 14 10 protein. 


115 


53 


1450 


sil2002226 


Hnmn ^afiipTiQ 

XlUlllU JflUlvllO 


C^1> T-TP4-tvnp 7inc fin af^r nrnfpin 

VyJlXV-'*! LJf|Jt/ ZjlllV XXXXgCX ^JXDLCXXX 


204? 


00 


1450 


gil 675 1522 


Mus mus cuius 


dioxin inducible factor 3 


1966 


88 


1450 


ml 3752272 


xxUlliU odpXClla 


laiyxi^cax caxL/xiiuiiia xcidLeQ proiem I 


1 ^04 

1 D\JH 


QQ 

yy 


1451 


gil3752754 


Homo s aniens 


zinc fineer 1111 


1977 


59 


1451 


gil 04403 98 


Homo sapiens 


FLJ00032 protein 


1893 


58 


1451 


AAB95862 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 18929. 


1881 


60 


1452 


gil 1064132 


Homo sapiens 


bB329D4.1.1 (novel protein, isoform 
1) 


380 


94 


1452 


gill064131 


Homo sapiens 


bB329D4.1.2 (novel protein, isoform 
2) 


370 


97 


1452 


gil 1064130 


Homo sapiens 


bB329D4.1.3 (novel protein, isoform 


369 


98 
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3) 






1 A C 1 

1453 


AAB63251 


Homo sapiens 


HJJJ W- Human breast cancer 
associated antigen protein sequence 

bJbv^ 1JJ INU.Ol J. 




QQ 


1453 


AAB63250 


Homo sapiens 


LUDW- Human breast cancer 
associatea antigen protein sequence 

OTTO TT» XTO»A1 9 


415 


74 


1 A CI 


AAr>03z4y 


nomo sapiens 


lujj w- numan Dredsi caiiL/cr 
associated antigen protein sequence 




74 




/\/\JVl oUZU / 


norno adjjieno 


UVQF. PTiiman nrntpin SFO TD NO 

in I uXj 11 L4.1I1U.J11 jjl Vj Ut/lll OJL<V^ J-Lv 

3853. 


^233 


56 




A AA/T7Q99'} 


noino adpieilb 


"RV^sPT- Human nrntein STFO TD NO 

XI X Oil" 11 Lllllclll jjUJLClll OJ-iy J-J-^ 

1885. 


2233 


56 


1454 


gi5080758 


Homo sapiens 


BC331191 1 


2159 


52 


1 yi C C 


1 oorM /i 
glloyU44 


nomo sapiens 


zmc linger procem h-z 


JUO 




1455 


gi8886436 


Homo sapiens 


myeloid zinc finger protein 1 splice 
variant 


368 


30 


1455 


gi4567181 


Homo sapiens 


BC37295 3 


332 


26 


•t AC/* 

1456 


gi 179845 59 


Brucella 
melitensis 


rU 1 A ilvri JLJJNA-BiJNJJliNvjr 
rKU 1 rilJN 


1 QA 
1 o^f 


DO 


1456 


gil5159614 


Agrobacterium 
tumeiaciens sir. 
C58 (Cereon) 


AGR_L_2275p 


180 


37 


1 A C /C 


gii ju /ozuy 


oinorinzo Drum 
meiiioii 


n i r\J 1 nri 1 lv^/\Jo JrxvVj l nilN 


1 ifi 




I4J / 


A ARO 1 H1 a 


nomo sapiens 


TM^\^_ T-TiimoTi rnir'lp'ir* ppiH—^itiHiti<t 

X1NV_^ X XlU.llla.ll llU.L'IClU O.V-'lVX-LfllliJ.Hlg, 

protein, NuABP-22. 


1085 


50 


1457 


gi55471 


Mus musculus 


Zfp-29 


713 


49 


1 A C7 
14 J / 


A A T T1 £>(\C\£l 

AAU loUUO 


nomo sapiens 


nuivi/\- numan novel secrcieQ proiem, 


ooo 


SO 


1 A^Q 


gl 1 OO / / / OZ 


nomo sapiens 


Qimilnrtr^ r* A A/TP rf i cr\r»Tici'v/p pmpnt 
kjllllllal iu u^iivxjr icokiuiioivc ciciiiciit 

modulator 


672 


100 


It'JO 


rri 1 1 1 O/t S 9 


nomo bdpicnb 


1-»f^P Th A/T 1 alnVip nrntpin 
ll\^l\JC»lvx l aipiid pi \J ic ui 


594 


99 


1458 


gi30217 


Homo sapiens 


cAMP responsive element modulator 


589 


98 




A A "Q CAIOC 


nomo sapiens 


nuivi/v- numan pancrediiu t/dnt/cr 
antigen protein sequence SEQ ID 

NO -777 

1NU. / / / . 


7S1 




1459 


gil 8676566 


Homo sapiens 


FLJ00180 protein 


231 


28 


1 /i cn 
14DV 


gizU4o /z 13 


Homo sapiens 


l^akjj i j-iiKe protein 


9 1 Q 




1460 


AAB94838 


Homo sapiens 


HELI- Human protein sequence SEQ 
1JJ JNU:loUUo. 


2706 


100 


1460 


AAB67573 


Homo sapiens 


1JNCY- Amino acid sequence oi a 
human hydrolytic enzyme HYENZ5. 


97A1 
Z /\J 1 


QQ 

yy 


1460 


gil8605651 


Mus musculus 


oimilar to KIKliJN CJJJNA 4y3UDo4iNzZ 
gene 




R9 
OZ 


1 Af\\ 

1 t -r\J 1 


rvrvvjuj UUJ 


rxuiiiu sapiens 


Human «iprrf»tpH nrntein SEO 

ID NO: 7686. 


299 


98 


1462 


AAM00936 


Homo sapiens 


HYSE- Human bone marrow protein, 
SEQ ID NO: 412. 


1385 


58 


1462 


gi20502367 


Homo sapiens 


chaperone- AB C 1 -like 


1385 


58 


1462 


AAM93752 


Homo sapiens 


HELI- Human polypeptide, SEQ ID 
NO: 3735. 


1381 


57 


1463 


gil212953 


Homo sapiens 


3-methyl-adenine DNA glycosylase 


1470 


99 


1463 


gi20 162529 


Homo sapiens 


N-methylpurine-DNA glycosylase 


1470 


99 
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Tlpcrt*intinn 


s 

score 
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identity 


1463 


gil4336679 


Homo sapiens 


N-methylpurine-DNA 


1470 


99 


1 A &A 


AAr>_>oy iy 


Jriomo sapiens 


IvUlMj/ n.U.Illd.11 pi US laic UailUCi ail tig di 

protein sequence SEQ ID NO: 1497. 


1651 


99 


1 A &A 

1404 


m'Q^OO <AA 

gloJ /Z j44 


riomo sapiens 


3-iiiGiiiyi-aiiciiiiic ul\x\ giyc/Dsyiaoc 




99 


1464 


gi20 162529 


Homo sapiens 


N-methylpurine-DNA glycosylase 


1505 


100 


1 A /CC 


ft '101 O AT") 

giiy izu /z 


ivius muscuius 


ATI 


JO / J 


7^ 


1465 


gi 1199931 


Mus muscuius 


pre-acrosome localization protein 


3875 


73 


1 A /C C 

1465 


' 1 C A7fl COO 

gu507955o 


Homo sapiens 


oinniai to j-azacytiame lnaucea gene i 


1 

jOO 1 


00 


1466 


AAU11813 


Homo sapiens 


UYLE- Cancer and neurogenesis 
associaiea gene, variani dkj-d v i. 


3219 


100 


14oo 


a atti 1 on 
AAU 1 1 o 1 / 


rlomo sapiens 


u i L/H" v^ancer ano neurogenesis 

ct c? c i »a rl (T£»n#^ iron cinf" ^\ T? 0 ^ \/" *^ 

aSSOClalCU. gcIiC, Vallalll Jis^j V z,. 


^ 1 44 


08 
yo 


1466 


AAU11816 


Homo sapiens 


UYLE- Cancer and neurogenesis 
associaieo gene, variani jjv-o v z. 


3144 


98 


1 A 

140 / 


A atti 

AAU 1ZZU0 


XJ/ATOOr\ f-i nvM Atl 

riomo sapiens 


vjxi i n iiuiiidn rivUH-y / y puiypcpuuc 

oCLJUCll^'C. 


1400 


9Q 


1 467 


A AOn?1 78 
AAVJvZ l/o 


JTXUlllU SapiCllS 


fTP^IT WnmfiTi QprrptpH r>rotpin SP-O 
vj x jnuniaii ocwcicu ^jx w lv^ in, uij\/ 

TD NO* 6259 


270 


45 


1467 
itu / 




Tjanroic pafAta 


"nrnlinp-ripli nrotftiTi 


99 


30 


1468 


gl 1 OVJ / utoo 


AAU111L.' aauiciia 


FT T00141 nrotein 


605 

\j \j 


96 


1468 


AAG01457 


Homo sapiens 


GEST Human secreted protein, SEQ 
TD NO- 5538 


372 


100 


1468 


gil 5209365 


Xenopus laevis 


kinesin-like protein 


164 


34 


1 4oy 


rri 1 Q6766PW 

glloO /OOUo 


riomo sapiens 


rLJUUZuj piUlCUl 


1 8?9 


100 

A \J\J 


1 A 

140^ 


rri9ft1 4Q99 1 

gizui4yzz i 


xiomo sapiens 


dXOIlClIldl llCaVy L/llalll tiyiicin lypc -J 


1 550 


41 


140Z? 


rri 1 799^4R6 

gli /ZZD4oO 


jnomo Sapiens 


t/iiiaiy tiyiicixi iicavy uiiani / 


1 591 


41 


1470 


gil7425160 


Mus muscuius 


testis serine protease2 


373 


60 


1 A 7A 

14 /U 


' i en OA/1 1 ^ 

gll jJzU410 


ivlus muscuius 


serine proiease-iiKe i 


^40 


61 

D 1 


1470 


gi2 1064337 


Drosophila 
meianogasier 


RE07247p 


260 


48 


1471 


AAE19166 


Homo sapiens 


INCY- Human protease, PRTS-3 
protein. 


1327 


99 


1 a n 1 
14/1 


giyyj / /ou 


riomo sapiens 




1 ^97 


yy 


1471 


g il4484922 


Homo sapiens 


prostinogen 


1323 


99 


1 /no 
14 /z 


a A "QO/1 ^ 1 C 

AAriy4ol j 


Homo sapiens 


ririL,!- Human protein sequence oriv^ 
ID NO: 15462. 




00 


1 /I TO 

14 /z 


AAY j lo4o 


rlomo sapiens 


rrcjr/ riuman lo.i nomoiog protein 
fragment. 


1 ^8? 


1 no 

1UU 


1 /no 
14 /Z 


A A AvT/1 AO 1 /I 

AAM4U3 14 


llomo sapiens 


n Y ors- riuman poiypeptiae oxiv^ iu 

INU o4Z)y. 


^79 
J / z 


96 


1473 


gil 8606245 


Mus muscuius 


RIKENcDNA 1810020E01 gene 


712 


70 


1 /lOQ 

14/i 


A A *\/f1 Q&A 1 

AAJVL5o041 


Homo sapiens 


riUiviA- riuman colorectal cancer 
antigen SEQ ID NO: 156. 


6Q9 
OVZ 


06 
y\j 


1 /IT? 

14/3 


A A TV yTAO 1 OA 

AAMy31 /U 


Homo sapiens 


riUiviA- Human digestive system 
antigen oxiv^ iu invj. zoiy. 


oyz 


Q6 


1474 


A ATT8901 7 


riLiiiio sapiens 


TNr^V- PTmrmti QPfrpfpH "nrAtPin 
xinv^ x xxtiiiiaii acwvitu LfivjLt/in 

SECP43. 


4623 


97 


1474 


gil8676716 


Homo sapiens 


FLJ00257 protein 


4041 


99 


1474 


AAM00753 


Homo sapiens 


HYSE- Human bone marrow protein, 
SEQ ID NO: 116. 


2042 


72 


1475 


AAU82017 


Homo sapiens 


INCY- Human secreted protein 
SECP43. 


4372 


93 


1475 


gil8676716 


Homo sapiens 


FLJ00257 protein 


3769 


94 


1475 


AAB47134 


Homo sapiens 


INCY- CDIFF-15, Incyte ID No. 


1757 


48 
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3478571CD1. 






1 A 

14 / 0 


AUriyU IjJL 


xiomo sapiens 


u i ju xiuman 1 umour nnaomciidi 
Marker polypeptide SEQ ID NO 236. 


1 1 

1 1 JO 


/ o 


1476 


gi6708478 


Mus musculus 


formin-like protein 


784 


57 


1476 


gi4 101720 


Mus musculus 


lymphocyte specific formin related 
protem 


744 


60 


1477 


gil8848216 


Homo sapiens 


Similar to RIKEN cDNA 2410016C14 
gene 


732 


97 


1477 


AAG02288 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID NO: 6369. 


561 


100 


1477 


gi4234 

• 


CI — 1 

Saccharomyces 
cerevisiae 


the product of PRP22 gene acts late in 
the splicing of yeast pre-messenger 
KJN A, mediating tne release oi tne 
spncea niiviN a ironi tne spiiceosorne 


o m 

zy / 


< o 

JO 


1478 


ABB061 66 


Homo sapiens 


AJIN Human liver with hepatitis 
leiatea protein ori^ iu inu. iz. 


2688 


100 


1 A HQ 

14 /o 


A-brSUolOJ 


rlomo sapiens 


a j UN xiuman nver wiui nepauus 
related protein SEQ ID NO: 11. 


1 6^ 

1 ODD 


1 on 


1 A HQ 

14 /o 


giyooooy 1 


ivius musculus 


zinc linger proiem z / o ^zriz type 






1479 


AAU76799 


Homo sapiens 


BIOW- Human oxysterol-binding 
protein oo /z. 


1535 


99 


1479 


gi63 18855 


Homo sapiens 


A-kinase anchoring protein 220 


379 


22 


1 A HC\ 

14 /y 


gi/o / 139z 


Homo sapiens 


daz 1 jjd Lj.i (A Kinase ^kjs^a ) ancnor 
protein 11) 


d /y 


zz 


1480 


gi!3016757 


Homo sapiens 


dJ553F4.3 (novel zinc-finger protein) 


2638 


99 


1480 


A A TJfVC 1 

AAB951 /o 


Homo sapiens 


HbJLl- Human protein sequence i>.bQ 
ID NO: 17237. 


1 1 JU 


1UU 


1480 


AAU16091 


Homo sapiens 


HUMA- Human novel secreted protein, 
Seq ID 1044. 


848 


80 


1481 


AAG0043 1 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID NO: 4512. 


230 


91 


1481 


gi20429112 


Paracoccus 

zeaxanthinifacien 

s 


mevalonate diphosphate decarboxylase 


/U 


^ 1 

D 1 


1 A O 1 

14ol 


gljl /DJDOOK) 

|ref]NP_499 

UOO. 1 1 


Caenorhabditis 
elegans 


Y3 /JJoA.ZZ.p 


00 


DO 


1 A CO 
145Z 


A A1V1Z Z> y 3 D 

1 


norno sapiens 


xi i oi-/- numan protein sequence ojz,v^ 
ID NO: 1470. 




Q7 


1/101 
14oZ 


gl!43 1 /oVU 


Oraiius gaiius 


spinaiin 


z / o 




1482 


gi 17426439 


Homo sapiens 


bA445016.1 (DXF34) 


269 


96 


1 A Ol 

14bi 


AAK94yo3 


Homo sapiens 


1JNKJV1 rluman survival motor neuron 
SMN protein. 




on 


1 A O") 

1483 


A A T> C\/CC\C\ 1 

AARyoyy 1 


T T „ _ • „ 

Homo sapiens 


1JNKJV1 Survival motor neuron protem, 
aenvea rrom cione 1 -dljjj4 i . 


1 con 


OA 


14o3 


gloZ41 oo 


rlomo sapiens 


survival motor neuron 




on 


idRd 


A A CrCiJdd^ 




CVF ST Human QerreteH r>rot(=»in <sFO 

VJXjO X XLlA.llla.ll oCV^I C/lCLl jJlULCill, uX^V^ 

ID NO: 6524. 


304 


100 


1484 


AAY50923 


Homo sapiens 


ALPH- Human fetal brain cDNA clone 
vclO 1 derived protein. 


68 


48 


1484 


AAY09001 


Homo sapiens 


USSH FMF associated protein pyrin 
MFEV. 


66 


40 


1485 


gil6755594 


Bos taurus 


mucin 


110 


30 


1485 


gil 8495795 


Bos taurus 


MUC1 protein 


110 


30 


1485 


AAM41866 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 


108 


30 
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NO 6797. 






1486 


AAW19919 


Homo sapiens 


REGC Human Ksr-1 (kinase 
suppressor of Ras). 


294 


81 


1486 


gill71250 


Mus musculus 


protein kinase related to Raf protein 
kinases; Method: conceptual translation 
supplied by author 


277 


78 


1486 


AAM25335 


Homo sapiens 


HYSE- Human protein sequence SEQ 
ID NO:850. 


221 


76 


1487 


AAM69345 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein SEQ 
ID NO: 29651. 


3728 


100 


1487 


AAM56960 


Homo sapiens 


MOLE- Human brain expressed single 
exon probe encoded protein SEQ ID 
NO: 29065. 


3728 


100 


1 A O 'H 

1487 


AAM42395 


Homo sapiens 


HUMA- Human polypeptide SEQ ID 
NO 128. 


599 


87 


1488 


AAM49117 


Homo sapiens 


CHUG- Human Spl family 
transcription factor h285. 


2174 


100 


"\ AOQ 

14oo 


gl9454416 


\A~,-.„ _ 1,,,— 

Mus musculus 


zmc tmger protein Sp5 


2122 


97 


1 A O O 

14ob 


gil797o563 


Xenopus laevis 


Spl -like zinc-linger protein XSPR-1 


1306 


65 


1489 


gi4323152 


Mus musculus 


Ets-protein Spi-C 


876 


67 


1489 


gi8745414 


Aulonocara 
nansbaenscni 


Spi-C transcription factor 


467 


49 


1489 


gil 1245502 


Raja eglanteria 


SpiC 


437 


42 


1490 


AAG01095 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID NO: 5176. 


261 


70 


1490 


gi7644350 


Homo sapiens 


golgi matrix protein GM130 


192 


69 


1490 


gi8099669 


TT T * 

Homo sapiens 


golgm-like protein 


178 


66 


1491 


gi|20903495 
|ref|XP 128 
165.1| 


Mus musculus 


RIKENcDNA 1700030F18 


298 


57 


1491 


gi|13541067 
|reflNP_110 

''ICC 1 1 

755.1| 


Tnermoplasma 
volcanium 


1 * j 1 TT*V^R T A 1 * 1 • j* 

Predicted DNA-brndmg protein 


70 


27 


1492 


AAM94393 


Homo sapiens 


HUMA- Human reproductive system 
related antigen SEQ ID NO: 3051. 


367 


95 


1492 


gil7861938 


Drosophila 
melanogaster 


HL03404p 


238 


43 


1492 


gil7862896 


Drosophila 
melanogaster 


SD02338p 


238 


43 


1493 


gil9527867 


Drosophila 
melanogaster 


AT11276p 


257 


26 


1493 


AAB94796 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 15921. 


252 


26 


1493 


AAM39450 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 2595. 


252 


26 


1494 


AAB93521 


T T * 

Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 12861. 


581 


41 


1494 


AAM90443 


Homo sapiens 


HUMA- Human 

immune/haematopoietic antigen SEQ 
ID NO: 18036. 


237 


97 


1494 


AAM70186 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein SEQ 
ID NO: 30492. 


166 


46 


1495 


AAB92790 


Homo sapiens 


HELI- Human protein sequence SEQ 


1709 


68 
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1U JNU. 1 IzoO. 






1495 


AAM87063 


Homo sapiens 


HUMA- Human 

immune/ naematopoietic antigen oxiv^ 
ID NO: 14656. 


174 


72 


1 AOS. 


a aivi o o y e> y 

• 


riomo sapiens 


TTT TTV X A Onwn« 

riUiviA- Human 

lmniune/naematopoieuc antigen oiiv^ 

TD "MO- 1 zlSR9 


1 /CO 




1 4Qfi 


cri?0S 1 09RR 


HnmA CQTMPTIO 

XxUillU oapiCXlo 


oiiiiiidi to undj ^rispHU ^ nomoiog, 
subfamily B, member 8 


1 9^£ 
1Z JO 


1 wu 


1406 
i *■+ y\j 


A AT T1 7SQQ 


XXUlilU bcipiCIlt) 


xi (j 1VJ./-V- iNovci signal uansQUCiion 

u/ciiiivvciy jji u ctnij ocu XX.J x j.vj*-t. 


1 979 




1496 


AAM95431 


Homo sapiens 


HUMA- Human reproductive system 
related antiaen SFO TD NO- 40S0 


1232 


99 


1497 


gi8777659 


Homo sapiens 


transcription factor HOXD1 1 


1576 


89 


1497 


pi871428 


A/fllQ Till IQPII 111 Q 




1 


ou 


1497 


gi397509 


Mus musculus 


HOXD-1 1 


1485 


86 


14QR 


ai7R7Qfi50 


griseus 


picOLin 


1 97 
1Z / 


9H 
ZU 


lt70 


ml 9QA669 

gllZ^OOUZ 


xXwlilU adpiCIlo 


pieuim 


1 9 S 


9 1 
Z 1 


1498 


gil561642 


Rattus 

• 

nui vegicub 


plectin 


119 


20 


1499 


AAM79656 


Homo sapiens 


HYSE- Human protein SEQ ID NO 

jjUZ. 


857 


81 


1499 


AAM78672 


Homo sapiens 


HYSE- Human protein SEQ ID NO 

1 'X'XA 


857 


81 


14QQ 


A RR 1 979Q 


xiomo sapiens 


n i kbri- Jtiuman r 1 as-ngana associated 
factor 3 homologue, SEQ ID NO:2699. 


OJ / 


0 1 

ol 


1 JUW 


gllZOZ^+.jLO 


/\raoiQopsis 

til Q 1 1 q n Q 

Ulalldiia. 


Aioi-iiKe protein, 41 Idd-^zddd 


1 oo 


z / 


1500 


gil2324510 


Arabidopsis 

Lllalldlld 


AIGl-like protein; 1 1637-17773 


183 


28 


1500 


gi4097585 


Nicotiana 

LctL/ct^tlill 


NTGP4 


160 


26 


1501 


gil932775 


Mus musculus 


paired-type homeobox gene 


276 


35 


1 SOI 

t Jul 


ail cns44fn 


A^llO T>"n 1 lilt c 


reima ana anterior neural ioia 
homeobox 


z /o 




1501 


gi2240024 


Mus musculus 


retinal homeobox protein 


271 


42 


1 509 


A ATTR07R7 


nomo sapiens 


tUKi- riuman lung tumour protem 
from clone L587S. 


13zU 


1 AA 

1UU 


1502 


gi 1604 1678 


Homo sapiens 


Similar to RIKEN cDNA 2810433K01 
gene 


1320 


100 


1502 


AAU80388 


Homo sapiens 


CORI- Human lung tumour protein 
from recombinant L587S. 


1315 


100 


1 jlo 


A A T70 1 ^11 


Homo sapiens 


HUMA- Human gene 15 encoded 
secreted protein fragment, SEQ ID 

JNU: lo4. 


3358 


100 


1503 


AAY21842 


X.X\JXXX\J OClLFXVZ/llO 


TT^Tf^^V— WnmnTi cional r\oiAfi r\ a 
XINV^ X XXUllla.il ilgllcll peptide— 

contianing protein (SIGP) (clone ID 
1273453). 




i no 

1UU 


1503 


AAE01494 

• 


Homo sapiens 


HUMA- Human gene 15 encoded 
secreted protein HBKED12, SEQ ID 
NO:150. 


879 


99 


1504 


AAB95183 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 17252. 


1492 


99 


1504 


AAG01491 


Homo sapiens 


GEST Human secreted protein, SEQ 


752 


99 
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SEQ 
TD 


Hit ID 


Species 


Description 


S 

iSVUl c 


Percent 

iHpn t\txr 

1 UCll Ll 








ID NO: 5572. 






1 S04 

JL Jut 


Qiy-ryyy \J 


iVallU-a op. 


ZilliL' llxlgCl piULCUl iv 1 f—i 


1 04 

JL UH 


9S 


1 SOS 




T \/f"< : >r'Vi inno 
i-^y LCOIllllUo 

variegatus 


IIUUXUlUUUlC^cloaiJL/laLCLl piVJLClll 


01 

y l 


9^ 


1 SOS 


ai 109*,^ 501 


JLxUilxU aapiciib 


Similar fn RTTf FXf rDT^TA 1 70001 QT700 

gene 


R4 


9^ 


1505 


gi3 132790 


Crithidia 
fasciculata 


CACK protein 


79 


26 


1 


A A"RQ^ZLOQ 
/Y/YJD y D *r KJ y 


nomo sapiens 


rirsi^i- riuman proiem sequence oxiv^ 
ID NO: 17793. 






1 <\OA 

13UO 




cams iaminaris 


ueacie 






1506 


gi20151453 


Drosophila 
meianogaster 


GH26759p 


77 


26 


1 ^07 


rri 1 1 1 Qf\AQ1 

gii i lyu^io 


riomo sapiens 


uJ\j £ tDL>ZD.z ^novei protein witn 

o 1 13/ ruA ^uroaa c-ompiex i ramuracK 

a UIuW 1 VJA. Vliuo CillU ZjIIIC IJJ-Jlgd 1 

domain) 


DD 1 / 


1 nr\ 


1507 




Hnmn Qanipnc 

XXVJlllW ouUltllo 


oiiiuidi l\J ±x.cioii pujic-iiioj oiiiiiidi LvJ 

BAA77027 TPID -24650844^ 


y / d 


^4 


1507 


AAU28187 


Homo saniens 


"HfYSP.- l^Jnvf*1 Viiirnan ^ftnrptorv nTAtpiTi 
Seq ID No 356. 


948 




1508 


eil478188 


Oallus ffallii*? 

V — 1 CilXLiu ClUll LXk_f 


CHoxE 

V^X 1V/A1.7 




IS 


1508 


ei6594621 


Danio rerio 


homeoHox nrotftin 


315 




1508 


ei7689275 


XenoDiis laevis 


hoiTieodotTiairi nrotein Hhx 

il\/illVv/Vi\JlllU.i.Xi L/X V-/ lV-111 V_XI_//V 


311 

_/ X X 


47 


1509 


AAG023 66 


Homo sapiens 


GEST Human secreted protein, SEQ 
ID NO- 6447 


391 


98 


1509 


gi 164 18290 


Homo sapiens 


RPGR-interacting protein 1 


146 


42 


1509 


ffi9966409 


Roc tfinTiic 


T?P^tT? -inf'Praptincr tirntpin- 1 

XvJT VJ XV 111 LCI a\s till g pi \J LC111 1 


1 07 
i \j i 


49 


1510 


gi9884738 


Homo sapiens 


dJ983H21.2 (A novel protein similar to 
AP-2 beta transcription factor) 


1151 


100 


1510 

1 Jlu 




TV/Tnc miic^nliio 
IVlU-o UlUoL/ Ulut> 


LianburipLioii idcior /\p-z oeta 


1 o^£ 

1 UjO 


77 
/ / 


1510 


gil495417 


Homo sapiens 


AP-2 beta 


1033 


76 


1511 


AAB95825 


Homo sapiens 


HELI- Human protein sequence SEQ 

1JJ INvJ. J.oo34. 


1429 


100 


1511 


gi2605967 


Equine 
Herpesvirus 4 


24 


156 


25 


1511 


gi2 1324755 


Corynebacterium 
giuiamicum 
ATCC 13032 


Translation initiation factor 2 (GTPase) 


146 


26 


1512 


AAB75305 


Homo sapiens 


ROSE/ Human secreted protein 
sequence encoaea by gene 1 j o±iv^ 1JJ 
NO:124. 


2785 


98 


1 SI 9 


gl 1? JZoZ JJ 


u ro s op mi a 
meianogaster 


Ulll JJOJp 




JO 


1^19 
ID LZ 


criQR7997 
giyo /ZZ / 


L^aenornaDQitis 
elegans 


iNucampnonn 


1 oo 


40 


1513 


ABB05592 


Hnmn wnipn? 


' 1 l^j^TL 1V111.VJL1^/ L/C11U UII1C1 C — dooUL/lclLCU. 

kinesin NYD-KIF2 related fusion 
protein. 


^S09 


yy 


1513 


ABB05589 


Homo sapiens 


UYNA- Human testis-specific protein 
1 NYD-TSP1 related fusion protein. 


3802 


99 


1513 


AAU 10784 


Homo sapiens 


UYNA- Human ubiquitin-like fusing 
protein (UFLP). 


3802 


99 


1514 


gil6755530 


Homo sapiens 


androgen receptor-associated 
coregulator 267-a 


12441 


99 
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ID 


-till XAJ 


opecies 


uescripiion 


c 

score 


x^ercent 
identity 


1 S 1 4 


rri 1 1 96Q 


ri urno Sapiens 


anarogen receptor associatea 
coregulator 267-b 


19441 


QQ 

yy 


1 S14 


ai 1 S? 1 ^S49 


rioino bapicns 


IN ioLJ 1 


I 1 Q 1 Q 

I I o i y 


07 

y / 




A ARQ4P,°,1 


JTIOIIIO bapiCIlb 


rii-<i_/i*" riuman proiem sequence oxiv^/ 
ID NO:16098. 


1 AC\A 


QQ 

yy 


ISIS 


/\^VI3 2? Z. \J D ZJ 


numo bapiciis 


nij/i_/i- numan proiem sequence oJqv^ 
ID NO:10961. 


QA 

y4 


7A 


1 -S 1 5 


A A frP»Q1 09 


noirio bcipicnb 


vjjj/o i numan se ere tea proiein, oxiv^ 
ID NO: 312. 


y4 


^A 






X^lllctlllU C U a 

histolytica 


uiapnanous proiein 


79 
/Z 


9Q 

zy 


1516 


gi3421373 


Mus musculus 


28 kDa cis-Golgi SNARE 


71 


25 


1J1U 


ai149Sn9^Q 


1Y1US JX1US CUIUS 


goigi oiN/vr receptor complex memoer 
1 


7 1 
/ 1 


Zj 


1^17 


gll J^Z^'f O J 


iriomo Sapiens 


oimiiar to k i k pjn cuin/v z fyj5U4zyAUo 
gene 


O 1 A A 

Z144 


1 An 


1517 


AAB94367 


Homo sapiens 


HELI- Human protein sequence SEQ 

rr* xt/^\. 1 a on 1 
lU JN<J: 14yUl. 


840 


99 


1517 


AAM82536 


Homo sapiens 


HUMA- Human 

immune/haematopoietic antigen SEQ 
ID NO: 10129. 


433 


85 


ID 15 


A A *DQ^7 1 7 


xiomo sapiens 


xliil^l- riuman protein sequence SbQ 
ID NO: 17333. 


oy 


24 


1 J I o 


glZLO 1 jU4 1 


i nermoanaeiooa 
cter 

LCIlgCUll^eilblb 


nucleoside-diphosphate-sugar 
pyrophosphorylase 


O /Z 

GO 


27 


U J. o 




KJI yL-lOldgUo 

cuniculus 


neuroniameni pro rem ivi 


OJ 


77 
Z / 


1519 


AAB94133 


Homo sapiens 


HELI- Human protein sequence SEQ 
rr> MA- 1 znoi 


3217 


99 


1519 


AAG81359 


Homo sapiens 


ZYMO Human AFP protein sequence 

OXZ/V^/ 1JJ IN \-J .Z,D\J , 


1874 


98 


1519 


gil7944262 


Drosopliila 
melanogaster 


LD30968p 


901 


30 


1 S90 


ail 0440^77 


nomo sapiens 


T7T TAAAO/1 t-*f/-vt-<a-;-M 

r L,j\j\j\JZHr protein 


oyj 


7 1 
/ 1 


1520 


AAB64372 


Homo sapiens 


INCY- Amino acid sequence of human 
intracellular signalling molecule 

TVfTP A A 
UN llvA.4 . 


83 


33 


1520 


AAG74084 


Homo sapiens 


HUMA- Human colon cancer antigen 
protein SEQ ID NO:4848. 


83 


33 


1 ^7 1 


A A A/f77/l^£ 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein SEQ 
ID JNU: 5 1 fbl. 


462 


100 


1521 


AAM64678 


Homo sapiens 


MOLE- Human brain expressed single 
exon probe encoded pro tern SEQ ID 
NO: 36783. 


462 


100 


1521 


AAM68208 


Hnmn sanipTis 


1V1W1jJ_j AXLiiiiail OOI1C illallUW 

expressed probe encoded protein SEQ 
ID NO: 28514. 


44R 


1 nn 

1 Uu 


1522 


AAM25562 


Homo sapiens 


HYSE- Human protein sequence SEQ 
ID NO: 1077. 


1299 


96 


1522 


AAY73335 


Homo sapiens 


INCY- HTRM clone 1850120 protein 
sequence. 


1294 


95 


1522 


gil817526 


Anthocidaris 
crassispina 


intermediate chain 1 


142 


30 
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ID 


fill Al-J 






s 

score 


identity 


1 J/J 


cri 1 ^ 1 07HQ 
gllO lu /Ui? 


IvdllUS 

norvegicus 


OPQ 1 

SCal 


667 


29 


1 ^9^ 
1 DZo 


0-1*980849'* 

glZoUo^fZj 


xiomo sapiens 


diaXin-i ^spmoceieorciiax ataxia type i 

YWC\\f* XT\\ 
Ul vJlCli.1 ^ 




9Q 


1523 


AAY33494 


Homo sapiens 


BURN- Human SCA1 protein. 


658 


29 


1 ^9zi 

1 DZ4 


a a nnin^A 

/\j^LxU4UOO 


xiomo sapiens 


orno i xiuman secretea proiem, oxiv^ 
TD T\TO* R147 

X.XJ Ox^t/. 


H 1 r 


07 


1524 


gil6185010 


Drosophila 

IIlCIaIlvJgd.b LCI 


LD32609p 


72 


37 


1525 


gil 99 19744 


Homo sapiens 


hepatocellular carcinoma-associated 

nrntf* in HP A 7 
piULClXl Xiv__^_f-\_,j 


1362 


100 


1525 


gil2659148 


Mus musculus 


mage-e2 


656 


88 


1 ^9^ 
1 JZ3 


cm 1 QQ 1 Q7zlO 

giiyy iy /4u 


nomo sapiens 


nep a to c e i luiar u axo inu iiia- as s o c ia ic u 
protein HC A l 


S90 


o 


1 JZO 


grHOjOZUD 


z^ea mays 


cyiociirorne dj reciuuLdbc 


971 


^1 


1526 


gil4536592 


Physcomitrella 

pel IC lib 


PP001069030R 


270 


31 


1526 


gil4536588 


Physcomitrella 

paiciio 


2 5 jpprot 1 _046_e0 1 


263 


36 


1527 


AAU82007 


Homo sapiens 


INCY- Human secreted protein 


1213 


47 


1527 


AAM39715 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 


1213 


47 


1 ^97 


A A TV/T41 ^01 


nomo sapiens 


xi x oXi- xiuiiidii poiypepiioe oxiv^ l-L/ 
NO 6432. 


1 1 0^ 

1 1 "J 


48 
to 


13Zo 


a atts9007 

AAUoZUU / 


xiomo sapiens 


xinv^ x - xiuman secreieu proiem 
SECP33. 


1 907 


4K 


1 jZo 




nomo sapiens 


xi i oxi- xiuman poiypepnu.e oxiv^ ix^ 
NO 2860. 


1 907 
1 zu / 


4R 
*to 


1528 


AAM41501 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 


1187 


48 


1529 


gi3253120 


Homo sapiens 


R31449 3 


2305 


99 


1529 


gi7673675 


Drosophila 
meianogaster 


cactin 


1038 


42 


1529 


gi21464388 


Drosophila 
meianogaster 


RE14858p 


839 


45 




AAMooiz 1 


Homo sapiens 


MUJL-b- xiuman bone marrow 
expresseo prooe encoaea protein oxiv^ 
ID NO: 26627. 


40 / 


/o 


1 




xiomo sapiens 


iviux^xi- xiuman orain expresseu smgie 
exon probe encoded protein SEQ ID 


40 / 


7/^ 


IjjU 


A AM7S1 61 


xiomo sapiens 


iviwx-cXi- xiuman Done riiaxrow 

CApiCoaCU. piUUC CIluLILlCU. piL/lCXll OXiV^ 

TD NO- ^467 


^ 1 4 

Jit 


7R 
/ o 


1531 


gi8346962 


Mus musculus 


1A13 protein 


1990 


97 


1531 


AAB93285 


TTnmo Qamprm 


H"RT.T- TTi i m a n "nrotein seanence SEO 

ID NO: 12336. 


1182 


59 1 


1531 


AAM79275 


Homo sapiens 


HYSE- Human protein SEQ ID NO 
1937. 


1182 


59 


1532 


gi2252814 


Mus musculus 


FOG 


3629 


69 


1532 


gi7595837 


Xenopus laevis 


Friend of GATA 


1847 


43 


1532 


gi4927696 


Mus musculus 


zinc-finger factor FOG2 


949 


31 


1533 


gi4079709 


Rattus 
norvegicus 


reggiel-1 


2106 


99 
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SEO 
ID 


Hit TD 
mi ii» 






c 


X ci tClll 
lvlCllLlL V 


1533 


gi4079711 


Rattus 
norvefficus 


reggiel-2 


2063 


96 


1533 


eil3277550 


Homo saniens 


Similar to flotillin 0 


1 862 

1 ouz. 


99 


1534 


gil 61 18243 


Homo sapiens 


ARAP1 


6340 


100 


1534 


ABBO7500 




TTSTPV- Human •TrTP-hinHiriCT nrotein 

il^w X XXlXllldXl VJ X 1 UlllLilllH, LJ1 \J IC1I1 

CGTPB i flD- 1299273CDli 


69R7 


Q9 


1534 


dl5625574 


Homo saniens 


pftntanvin Hplta9 

vvli LCI. 141 111 IX w 11X1 A. 


J'w' X 


1 00 

X WW 


1535 


gil 61 18243 


Homo sapiens 


ARAP1 


6255 


99 


1535 


ARR07500 

JTiJJJJU / JUU 


T-Tf^tnn cam'pnc 
xxwinw oopiciio 


xi>ji^ i - xxu.iij.ctii vj i jr-Dxnu.ing proiem 
(GTPB) (ID: 1299273CD1). 


ozuz 


OS 


1535 


gil5625574 


Homo sapiens 


centaurin delta2 


5866 


99 


1 J JU 


A ATT1 71 


xxwiiiw odpiciio 


xiiJivi/\- iNovei signal iransaucuon 
painway protein, oeq ijl' / iu. 




1 no 


1536 


AAM48374 


Homo sapiens 


SHAN- Human PP905 protein. 


157 


41 


1J JU 




xxwjluw octpicnb 


x x y\jD 


ID / 


A 1 

41 


1537 


gi20988071 


Mus musculus 


Similar to RIKEN cDNA 260001 1E07 
gene 


156 


47 


1537 


gi6958206 


Pneumocystis 

• * • f* 

cannu i. sp. 
muris 


kexin-like protease KEX1 


137 


30 


1 S^7 


tri 1 S01 7S^R 

gl 1 J7l / JJO 


IJ-OHIU bctpiCIlb 


iM vjo o/oya 


1 1 1 
i j i 


zo 


1538 


gil 7946424 


Drosophila 

illclallwgab LCI 


R£67445p 


241 


42 


1538 


gi2 1464585 


Arabidopsis 

f"ll 5* 1 1 CI n Q 

Ilia 11 alia 


AT3g02910/F13E7_14 


181 


42 


1538 


gi552255 


Lytecliinus 

LJll/lUo 


troponin C 


146 


51 


1539 


ABB 11447 


Homo sapiens 


HYSE- Human secreted protein 
nomoiogue, oxiv^ xju* inu.ioi /. 


1764 


100 


1539 


ABB89981 


Homo sapiens 


HUMA- Human polypeptide SEQ ID 


1634 


100 


1539 


AAY41528 


Homo sapiens 


HUMA- Fragment of human secreted 
proiem encoaea oy gene / / . 


1172 


99 


1540 


AAB95462 


Homo sapiens 


HELI- Human protein sequence SEQ 
m MH' 1 no a a 


1488 


100 


1540 


AAB94494 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 15186. 


1246 


99 


1 S40 


rV/WJD'+j yZ 


jiomo sapiens 


Jtsiou- Human reducing agent ana 
tunicamycin-responsive protein 40. 


1246 


on 






Mycoplasma 
pulmonis 


f AC Tl T"F> /"~Y C /TAT T»"P» /^TTTM TIT 

50b R1BOSOMAL PROTEIN L35 


229 


74 


1 DHr 1 


rri 1 <JO 

gil jU 1 jU 


Mycoplasma 
fermentans 


ribosomal protein L35 


213 


69 


1 ^A 1 


cri9 1 9H/17Q9 

giz izu4 /yz 


otapnyiococcus 
aureus subsp. 
aureus iviwz 


5 OS ribosomal protein L35 


175 


c o 


1542 


eil3543686 


Homo ^aniens 


Similar to PfPCFN rTTNTA 

Olllllldl LVJ XvXXVX-rl>) \-J 1 > A *-rI7 J 1 'rZ.Oi VJZ, 

gene 


7^0 


97 


1542 


AAY07081 


Homo sapiens 


LUDW- Renal cancer associated 
antigen precursor sequence. 


261 


38 


1542 


AAE13761 


Homo sapiens 


CORI- Human lung tumour-specific 
protein LT86-14. 


226 


35 


1543 


gil9683951 


Homo sapiens 


Similar to DNA segment, Chr 6, 
Miriam Meisler 5, expressed 


2234 


99 


1543 


gi4008004 


Mus musculus 


D6MM5E protein 


1651 


74 
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ID 


Hit TT) 




u cscripuon 


Q 

score 


irercent 
identity 


1 54^ 


gjlTUu / 700 


A/Tito mncfiiliic 
IVJLUo limb L> 111 Uo 


uoiviivUG protein 


i 1 

1 OD I 


7 A 


1544 


gil 1493263 


Homo sapiens 


bA196N14.1 (novel protein) 


749 


100 


1 ^AA 


rri 1 £^99GQ 

gl 1 OjZZyy 


o ynec no c y s n s 
sp. PCC 6803 


JJINA ligase 


I 1 


32 


1 S44 


aifkAQO'Xfifi 
glOHi/Z jOO 


ivjLus mus cuius 




/z 


Jo 


1545 


gil3938461 


Homo sapiens 


Similar to RIKEN cDNA 493342 8D01 
gene 


938 


100 


1 54S 


A AM79S04 


inuniu bcipiCllb 


xi 1 on- xiuman proiem oriv^ lu lskj 
^ 1 50 


on 


9 Q 

Zo 


1545 


AAM78520 


Hnmn Qjini^nc 


Human nrntpin TTj MO 

1 1 82 


on 


9R 
Zo 


1546 


AAB 94901 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO- 16290 


813 


58 


1546 


gil3122230 


Leishmania 

maior 


CDC27/NUC2-related protein 


101 


28 


1546 


gi468012 


Bos taurus 


PKR inhibitor P58 


100 


26 


1 547 


oil SS6S414 


XH/II1U odpiCllb 


xj jtvo i - Diiicimg proiem ci^u/ l 


-?zy i 




1547 


gil 1527189 


Homo sapiens 


p250R 


2021 


84 


1 S47 


glO'+Oi'O i. / 


liOiiio sapiens 


o w i-oiN-T complex pro tern pz / u 




4 / 


1548 


ABB06069 


Homo sapiens 


COMP- Human NS protein sequence 


1342 


95 


1548 


AAG89296 


Homo sapiens 


GEST Human secreted protein, SEQ 
JJL> JNU. 410. 


1342 


95 


1548 


AAG89265 

» 


Homo sapiens 


GEST Human secreted protein, SEQ 

TT\ XTO . IOC 


1330 


95 


1549 


gi9837427 


Lytechinus 
variegatus 


embryonic blastocoelar extracellular 
matrix protein precursor 


259 


29 


1549 


gil5291905 


Drosophila 
meianogaster 


LD31354p 


152 


29 


1 SdQ 


gl 1 UUZoUo 


j^ytecninus 

ValiegdlUS 


extracellular matrix protein 


139 


25 


1550 


gi7288419 


Treponema 

palllU-UlIl 


tpr protein K 


68 


48 


1550 


gi|l 1095200 

Ip-HIA AO907 

72.1|AF227 
220 1 


Treponema 

palllQUITl 


TprK 


66 


45 


1552 


gi35 10757 


Homo sapiens 


glutathione transferase zeta 1 


848 


80 


1 


cri741 7477 


xiomo sapiens 


CVT7 1 
VJ lZil 




OA 




ml 01 


nomo sapiens 


giutatmone transierase zeta l 
^maieyiacetoacetate isomerase ) 


Q AQ 


oil 


1553 


AA"R94701 


XXUIT1U Sapiens 


niii--i"- xiuman protem sequence oxi^ 

XJU* 1NU.1JDOO, 


OJO 


1 AA 
1UU 


1553 


AAM79264 


Homo sapiens 


HYSE- Human protein SEQ ID NO 

1 096 

1 


171 


22 


1553 


gi7839650 


Cricetulus 
griseus 


p lectin 


111 


22 


1554 


gi9971158 


Homo sapiens 


GTP-binding like protein 2 


1286 


99 


1554 


gil 356 1007 


Homo sapiens 


bA22I24.2.1 (GTP binding protein 2) 


1286 


99 


1554 


gi20306212 


Homo sapiens 


GTP binding protein 2 


1285 


99 


1555 


AAG93258 


Homo sapiens 


NISC- Human protein HP 10582. 


1348 


100 


1555 


AAG7463 1 


Homo sapiens 


HUMA- Human colon cancer antigen 
protein SEQ ID NO:5395. 


869 


95 


1555 


gil7863052 


Drosophila 
meianogaster 


SD10403p 


350 


35 
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1556 


AAY92330 


Homo sapiens 


CURA- Human NIK 1 protein. 


1918 


87 


1556 


gi507875 


Homo sapiens 


NIMA-like protein kinase 1 


1918 


87 


1556 


gi479171 


Homo sapiens 


protein kinase 


1918 


87 


1557 


AAY92330 


Homo saniens 


CURA- Human NIK 1 nrotein 


1943 


89 


1557 


gi507875 


Homo sapiens 


NIMA-like protein kinase 1 


1943 


89 


1557 


ei479171 


Homo sanip/n<; 


nTofpin IrinaQP 

IJi \J XIX XVXX XCl O t 


1 94^ 




1558 


gil3469818 


Mus musculus 


bicaudal-C 


2179 


89 


1 JJU 


H78001 80 


yVC/llL>|J tla XdCVli 


UxUdlKIdx~V> 


1 6Q7 

i oy i 


7^ 
ID 


1558 

X *J *J o 


ail ^599175 


T PicTrmarn n 
X^GXolxXlXdXixd 

major 


PPg^ 


IjO 


9H 
ill 


1559 


AAB95608 


Homo sapiens 


HELI- Human protein sequence SEQ 

Xx_> 1NVJ. IOjUU. 


1976 


99 


1559 


AAB94407 


Homo sapiens 


HELI- Human protein sequence SEQ 

TD >Jf> 1 4QRQ 


1847 


98 


1559 


ABB04606 


Homo sapiens 


BOD A- Human RNA recombining 

arpfl 1 8 rvrntpin 9FO TD "MO -9 


849 


100 


1560 


gil5823651 


Homo sapiens 


ALS2CR1 1 


1254 


82 


1 560 

X J uu 


A AM57449 


XXUIxltJ odJJxCLxo 


lvlkj l^li,- rxLiiTiaii Didin expresseo smgie 
exon probe encoded protein SEQ ID 
NO: 29547. 




OQ 
OO 


1 560 


Ail V OJ 1 Jt 

aal 


XTxUlXltJ odpiCllo 


vjniYi i Jroiynucieonae cione quh-iu 3 
encoding a secreted protein. 


oU 


ZD 


1 561 


en* 5 869804 


J-^X IJoUJJXxxxd 

m ft 1 a n o q a <? tpr 


CxallipeU piULtJlIl 


9C7 
Zo / 


ZD 


1561 


gil5291845 


Drosopnila 

mpla nocra<;tpr 

XlXt/XClXXUgClO IV/ 1 


LD29481p 


287 


25 


1561 


ei557822 


S5appTiaroTYivpp«i 

cerevisiae 


malS qtal Iptt 1^^7 PAT* 0 ^ 

AMYH YEAST P08640 

GT T ICO AMY! ASF S1 (VC 1 ? 1 ^ 




90 
zw 


1562 


AAB95707 


Homo sapiens 


HELI- Human protein sequence SEQ 

TDTSTO-IRSSO 

XJ-/ nVJ.lOJJu^ 


896 


100 


1562 


eil 5026974 


XX\JXXX\J od|Jl(/XXo 


UL'aL/LXX XXX 


AAQ 




1562 


AAU25456 


Homo sapiens 


INCY- Human mddt protein from 


149 


36 


1563 


AAU 11805 


Homo sapiens 


MDSP- Human GRF2 protein. 


6402 


100 


1563 


ei5882990 


XXU111U odJJlClXb 


xvdb gudiiiiic nucieouue cxcnaiige 
faptnT 9 




1 oo 


1563 


AAM39825 


1-Tomo <5aTiipn<s 


HYSF- Human nnlvnpTitidp WD TT) 
xx x l)Xv xxLixiidii uui y ucliliuc oxj»\j ix-/ 

NO 2970 






1564 


AAU 11805 


Homo sapiens 


MDSP- Human GRF2 protein. 


6412 


100 


1564 


ei5882290 


T~Tp«Tvir> Qanipnc 


1-? ja c cxi laninp niir" 1 t^n+i rl/^ PYplionop 
XvdC) gUdXXXXXC 1XU.V/1CULXU.C CXt/Ildll^C 

factor 9 


£41 9 


1 00 


1564 


AAM39825 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 2970 


6407 


99 


1565 


gi951270 


Homo sapiens 


IPP-2 


503 


79 


1565 


e i474388 


xxuxxxu odjJiCLia 


inlTiV*if"r\T 9 
xxxxixuxtux ^ 


SOI 


70 i 
/ y 


1565 


gi4704218 


Homo satoiens 


inhibitor 2 of nrotein nhosDhatase 1 


503 


79 


1566 


gi20152557 


Mus musculus 


dendritic cell and B cell derived 
chemokine 


72 


25 


1566 


gi59 11365 


Mus musculus 


macrophage-derived chemokine 


72 


25 


1566 


gi5757694 


Mus musculus 


macrophage-derived chemokine 
MDC/ABCD-1 


72 


25 


1567 


gil3676773 


Chlamydomonas 
reinliardtii 


PF6 protein 


127 


31 


1567 


AAM79809 


Homo sapiens 


HYSE- Human protein SEQ ID NO 


111 


26 
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3455. 






1 ^^7 

130 / 


A A \/f7Q CO ^ 
/\/\lVl /ooZj 


ixomo sapiens 


TjfVQF Wnman nrntfin QRO TD TSTO 
XX I oX3 - xlUIlldll piULClll uDy xxJ 

1487. 


1 1 1 
111 


26 


1 ^AQ 


A A 130^879 


riomo sapiens 


T-ITJT T Unmcin nmtpi'n cpnilPtlfP Th 
xxxIjX^x" Xllimall pUJLClll aClJUCliL/C ODy 

ID NO: 18952. 


1641 

X WT" X 


1 00 

X \J\J 


lZJOo 


rriocnn^ 1 7 
giysuu d i / 


rai 

L/ y IXJlllCgdlU V 11 Uo 

Maastricht 


r»T? 1 H9 
pivlUZ 


141 

X*T X 


32 


1 S6R 
1 J uo 


ail 1*569004 

gl xDJ OZUUH 


7sI^t\Tii1q 

1 > CjJlllld 

madagascariensis 


maior ammillatp ^mYlrnin 9-lilfp nrotpin 

llldJVJl ClllllJ U11C4.LC OJJ1U1U111 £* XXXWs VJXKJWjXXX 


138 


32 




A ATT07RRQ 


XXU111U odLylCXio 


\a7T-TFT"i T>n1 vnpntiHp (jpnnpripp for 

human hspG34a. 


963 


92 


1 *\6Q 


A AT T07RQ0 


nuinu bapiciio 


AA/T-TT-* D T^o1\mf=»'ntiH<= k Qpniipnpp for 

V V X xxlixJ x \Jxy UCUU Lie oCUUC/Xl^C X\J1 

Tinman h<mf"r'34h 

llLXllldll llo|JVJ»^*TU'. 


575 


72 


1569 


gil3603867 


Homo sapiens 


ferritin heavy polyp eptide-like 17 


575 


72 


i <;7n 


A ATT1 1 £1 7 


xiomo sapiens 


LJ X X-iX-j" V^d.llL'Cl dllU. llCUlUgCllColo 
aQ<2nr»iatprl apnp variant ST?9^V7 

ClooVJV^i.ClLt/d gC/llC', VClllCllll. ~> 1VA V . 


1 S77 

X / / 


69 


1570 


AAU11816 


Homo sapiens 


UYLE- Cancer and neurogenesis 

accrifMatpH ctptip variant ST?— ^\/9 

CtOOUVlulCU gCllCj VClllClllL J1V-J V . 


1577 


69 


1 S70 


A AT 11 1814 


XlVJlllU odUiCllb 


T T"VT T-^-. Pnnppf anH nf*nrocrf*TiPQi^ 

X X/1j V^-ClllV^V^l C1.11VX ll^/Wi\J ^V-'llV^CDlO 

a^^opiated ?ene variant "5G-3V2 


1577 


69 


1571 


gil 8204 103 


Mus musculus 


RIKEN cDNA 4930429H24 gene 


531 


38 


1 S71 


A xvl VI _> O ^ -J VJ 


ihjiiiu 5d|_»iciio 


HVSF- Human nnlvnentide SFO TD 

±± X Ulj X X\JLXXX<XXX yJKJXy LJ\slJl.X\X\~/ U JL-/ V^/ 11^ 

NO 2101. 


496 


31 


1 S71 


01*66441 76 


T-Fr\Tnr\ com f^n c 
XXU111U odJJlvllo 


Vp1pTi-1i1<rp "nrotpin TTT T-TT ^a 


496 


31 


1572 


AAO 12605 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 26497 


941 


100 


1572 


gil5145797 


Sus scrofa 


basic proline-rich protein 


402 


32 


1 S79 


aiSQI 7666 


ZjCa iiidyo 


pyIpti cin— lilr** nmtfMn 

CA.LCX10111 lllvC UlUlClli 


375 


29 


1573 


AAG75965 


Homo sapiens 


HUMA- Human colon cancer antigen 

nrntein ^FO TD NO -67? 9 

JJlULdlX OXJ/V,/ xxJ IN \J ,KJ I 


456 


96 


1 57^ 


A A O01 560 


1 lUIIlU odLJldlo 


(^ThQ'T 1 TTnman <jpf*rptpH nrntpin SIT-i'.O 

ID NO: 5650. 


456 


96 


1 S7^ 


A AT34154Q 


JTKJlllU bdJJXClla 


T-TT TA/T A - T-Tnman panppr aQQnpiatpn* 
nrntpin semienre vSEO TD NO* 994 


456 


96 


1574 


AAB36610 


Homo sapiens 


INCY- Human FLEXHT-32 protein 

Qpnnpnpp SFD TD NO-^9 


547 


46 


1 574 


A A1VRQ1 74 


xXv/lllU odLJlCllo 


TTV9F- Human nnlvnentide SFO TD 

XX X OIj XXuXlldll L>Ul_yiJC'|JtHJ.C O-L/V^ XXV 

NO 2269. 


547 


46 


1 S74 


ai'6R41 1 76 


XXiJlIlU scipiciio 




547 


46 


1575 


AAG75449 


Homo sapiens 


HUMA- Human colon cancer antigen 

nmt<=»in 9FO TD TsJO*691 ^ 


1070 


99 


1575 


gi2982673 


Homo sapiens 


p27 


1033 


100 


1 S7S 


gll •J^ZoO'+O 


riomo sapiens 


oj ogren s oyiitironic/ si-ieroQeiiiid 

dUIOalltlgCll 1 


1033 

X \J ~J *J 


1 00 

1 uu 


1576 


gil9548926 


Homo sapiens 


rhysin 2 


2721 


99 


1576 


AAM00777 

-i iiviv/w ill 


Homo "sanipnQ 


T-TYSJF- Human hone marrow ryrotein 

XX X k./ -A— / llUlllUli L/V/llv lllUilU V Y pi vlvlllj 

SEQ ID NO: 140. 


2659 


99 


1576 


AAM00890 


Homo sapiens 


HYSE- Human bone marrow protein, 
SEQ ID NO: 366. 


1134 


100 


1577 


ABB50256 


Homo sapiens 


INCY- Human transcription factor 
TRFX-107. 


2728 


99 


1577 


gi20987263 


Homo sapiens 


zinc finger protein 289, ID1 regulated 


2636 


97 


1577 


AAB95129 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO:17133. 


2625 


96 
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1 J / o 


rrtQ£999 1 Q 
gli/OZZZ li7 


1x3. ITU S 

norvegicus 


— ; — — — ; 

beta-catenin binding protein 


2550 


95 


1 57P. 
ij/o 


A AA/T7/a9^C 
/\/\J.Vl /OZjo 


xiomo sapiens 


MOLb- Human bone marrow 
expressed probe encoded protein SEQ 
ID NO: 36564. 


560 


100 


1 -> / o 


rV/\iVl O 3 H H 3 


riomo sapiens 


MOLE- Human brain expressed single 
exon probe encoaecl protein bxiQ ID 


c y f\ 

560 


100 


1 579 


glJOUOOV/A 




czy 


1 /U4 


73 


1579 


gil2642308 


Ovis aries 


type I keratin intermediate filament 
IRSal 


1671 


72 


1579 


AAB34359 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded Dy gene o ors^ ID 
NO: 120. 


1228 


69 


1 5S0 


AA J. 0_> V-/J7^/ 


Rntnn oiinipfl o 
XXUJXJAJ odXJlCIlo 


lj x im i r-oox protein rrsi^-ZD. 




1 AA 


1580 


gi6 164743 


Homo sapiens 


F-box protein Fbx20 


1002 


100 


1580 


AAY83072 


Homo sapiens 


UYNY F-box motif of FBP-25. 


261 


100 




rri9£99'3ft 1 
glZOZZjO 1 


Meinanotiiermo d 
acier 

incixTia.uioiropnic 

uo oil. XJCllci XI 


conserved protein 


107 


37 


1 5R1 


cril no ,°wS^4Q 
l7000JT , 7 


ivxeixxdiiopyrus 
kandleri AVI 9 


z^n- dependent nyciroiase 


OA 


31 


1 5R1 


<ri545£47Q 


JT y I U U U L/C lib 

abyssi 


XT x LJisXJJv I Y l^Url_,U 1 A 1 xllVJJNxi 

HYDROLASE related (EC 3.1.2.6) 

fm VDY ATA TH 


/y 


26 


158? 


eil2655474 




ivciaiiii abbociaiea protein y.o 


1 

yoi 


Q/1 

y4 


1582 


gil2655470 


Homo sapiens 


keratin associated protein 9.4 


933 


93 ' 


1582 


ail 9655466 


xxuinu btipiciib 


Keraun associated protein y.z 


oyy 




1583 


gi60 18193 


Mus musculus 


nucleotide-binding protein long form 


1391 


82 


1 58^ 


A A V777^ 


xiomo sapiens 


1JNC Y- rlxKJVL clone 1570914 protein 
sequence. 


1344 


83 


1 58^ 




uiciyosteiium 
discoideum 


xNUCieotiae-binding protein 1 


978 


60 


1 5R4 


cn90f)71 960 


ivxus muscuius 


KixvbJN cUJNA ZilUUjoxll / gene 


242 


35 


1584 


gil8252514 


Homo sapiens 


hepatocellular carcinoma-associated 
antigen xil^/\j j /d 


233 


34 


1584 


gil8252512 


Homo sapiens 


hepatocellular carcinoma-associated 
antigen huajj / a 


213 


31 


1585 


AAM00916 


Homo sapiens 


HYSE- Human bone marrow protein, 
SEQ ID NO: 392. 


1060 


40 


1 58^ 


/^^A_LV1U U o U D 


Homo sapiens 


T_T"\7"C 1 T? XT _ "1 . ■ 

HYSE- Human bone marrow protein, 
SEQ ID NO: 166. 


1060 


40 


1585 


AAM79000 


Homo sapiens 


HYSE- Human protein SEQ ID NO 
1662. 


1060 


40 


1586 


AAG03361 


Homo sapiens 


GEST Human secreted protein, SEQ 
TD NO* 7449 

XL* /H" 1 +Z. 


204 


97 


1586 


gi 1948 1932 


shrimp white 
spot syndrome 
virus 


WSSV340 


80 


41 


1586 


gil 70 16682 


shrimp white 
spot syndrome 
virus 


wsv285 


80 


41 


1587 


gi3213225 


Homo sapiens 


T-box-containing transcriptional 
activator 


952 


84 



WO 03/029271 



PCT/US02/30474 



296 
Table 2B 



SEQ 
ID 


Hit ID 


Species 


Description 


S 


Percent 

lllCllLlLj' 


1587 


eil2620817 


TVTn*? miiQmiliiQ 

J.YXLXO UlUk} v UlUO 


T-box 1 


7QQ 


57 


1587 


gil3 173432 


XVXWO XXXIXOwUXIXO 


T-nnv 1 frfi n cr'Ti'irH nn fnctrw 

x *UUA 1 IX aXXol/XXLJ IXUXX lclOlUI 


701 

/ s X 


Sfi 


1588 


ABB55706 


TTatyia cnnipnc 


x^JCfV^lTx/ XxUllldll pUiypcpilU-C OJC*V^ l-L' 

NO 18. 


i 1 09 


! OA 


1588 

X w> w U 


AAU38997 


T-Tomn <3a"niPTi<5 


f"Tr"F?"IV/T*V 1-Ti 7 m ci n of»r*i*<=>+/=»r1 Mfntpin 

V_J J— ilvx X XXUXXiCtll oCUlClCU UXL/ICXX1 

yall 1. 


1 n? 


94 


1588 


AAY17227 

x\_iX XX/ 1 


TTntnA ^arjipn^ 

JLXvlilv OClLyXl/XXO 


vjx-rivx x xxuxuaxx oc^icicu pxuicxxx ^ triune 

val-1") 

VAX X 1 ^ 


1 0? 

1 \JA 


94 


1589 


gil4577933 


Mus musculus 


ribonuclease/angiogenesis inhibitor 


258 


29 


1589 


A AB 14966 


i-Tomo ^anipTiQ 


AA/T.^If" 1 T-Ti l TYl si ti rinATinflptJOP in1ii1^iTr\T 
VV luL/ xniXXXlalX 1 lUUllUUlCcioC XXXxxXUXlUl . 


947 


90 


1589 


gil 90848 


Homo sapiens 


ribonuclease/angiogenin inhibitor 


247 


29 


1 S0O 


A AR4^4^4 


XXOxIXO odpiCllo 


rnjivx/\- xiuman cancer associateu 
protein sequence SEQ ID NO:879. 


J / O 


1 A 


1 SQO 


A A V070?Q 


norno bapicub 


luuw- jtsreast cancer associateo. 
antigen precursor sequence. 


o fo 


/4 


1 SQO 


ai 1 00914 


rxoiTio Sapiens 


aciaic riDosomai pnospnoprotein (Jr ij 


3 /o 


/4 


1591 


gi3 171877 


Homo sapiens 


dJ127D3.2 (Flavin-containing 
iviono oxygenase iamiiy pioremj 


2862 


97 


1591 


AAW49700 


Homo sapiens 


INRM Human flavin-containing mono- 
uAygeiidbe lboiorm x. 


2794 


96 


1591 


gil2006730 


Rattus 

XXUX VCglLUo 


flavin-containing monooxygenase 


2138 


72 


1592 


gi3171877 


Homo sapiens 


d J 1 27D3 .2 (Flavin-containing 
iviuiiuuxj'gcnabt; xairiiiy protein^ 


2862 


97 


1592 


AAW49700 


Homo sapiens 


INRM Human flavin-containing mono- 
oxygenase isoiorm x. 


2794 


96 


1592 


gil2006730 


Rattus 

ilAJx VCgXLUo 


flavin-containing monooxygenase 


2138 


72 


1593 


AAG74122 


Homo sapiens 


HUMA- Human colon cancer antigen 
protein SEQ ID NO:4886. 


375 


83 


1593 


gi6633816 


Arabidopsis 

olio ft r\ 

inanana 


F1N19.14 


140 


27 


i sen 


gl-70 J / JOJ 


i aKiiugu 
rubripes 


retinitis pigmentosa \j l fase regulator- 
like protein 


1 ir\ 
13U 


O 1 

zl 


1 SQ4 


A ARQ9S£Q 


riomo sapiens 


rriix^i- riuman protein sequence orsQ 
ID NO: 10777. 


1 n 1 1 


1 AA 
100 


1 SQ4 


aiS^ 1918 


T) nfhic 

ixauiis 
norvegicus 


ammo acia leature. nomeoaomam, up 
647 .. 826; amino acid feature: LIM1, 
op od .. 3ii/j ammo acid, teature. 
LIM2, bp 341 493 




96 


1594 


gil037166 


Danio rerio 


zfIsl-2 


1683 


88 


1 SOS 


A ARQ4S79 


nomo sapiens 


xiiii^i- riuman protein sequence odi^ 
ID NO: 15358. 


/3o 


1 AA 

100 


1596 


AAG02673 


Homo sapiens 


GEST Human secreted protein, SEQ 

TT* "MY"V ^^7^/1 

1LJ 1N\J. O / j4. 


359 


100 


1596 


gi9916 


Plasmodium 

Xd.XV^ipdi UX11 


liver stage antigen 


300 


20 


1596 


AAM82393 


Homo sapiens 


HUMA- Human 

immune/haematopoietic antigen SEQ 
ID NO:9986. 


223 


95 


1597 


AAM69464 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein SEQ 
ID NO: 29770. 


456 


98 


1597 


AAM57075 


Homo sapiens 


MOLE- Human brain expressed single 
exon probe encoded protein SEQ ID 


456 


98 
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NO: 29180. 






1 5Q7 
i -jy i 


gX lOU / UJOU 


jtauihu od.picnb 


r Ljjsjyj loV protein 




j 1 


1598 


AAG03347 


Homo sapiens 


GEST Human secreted protein, SEQ 

TPi "MO* 749 8 
LU i\\J. f £ rZ,o. 


219 


97 


1598 


AAB92646 


Homo sapiens 


HELI- Human protein sequence SEQ 
iu lyKj. ivy /y. 


117 


39 


1598 




T-FoTYin ennifne 
J. CsdjJlCiXo 


^ wxvx- oreasi uimour-specmc pioiein 
R305D 


1 1Z 


"2 Q 
JO 


1599 


AAM42062 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 

"MO 600^ 


346 


67 


1599 


AAB08765 


Homo sapiens 


INCY- A human leukocyte and blood 

ILXalCU piULClll yi^XDjnLjr }. 


278 


60 


1599 


A AM40276 


J. -LCllivJ GdjJlCJXo 


±± i oJ3- xiuiiian puiypepiicie od^ 1JL/ 
NO 3421. 


•9 7 A 


/ Z> 


1600 






T3 A pi Qfl 


ojUo 


1 r\n 
1UU 


1600 


2il20R3R75 


J. J.VS1J.AVJ odlJltllo 


p oiy ui umu- 1 


oZUO 


yo 


1600 


AAG78407 


Homo sapiens 


MERE Amino acid sequence of 
iiupoiy di onio i . 


8135 


96 


1601 


AAB94448 


Homo sapiens 


HELI- Human protein sequence SEQ 


2590 


99 


1601 


gi 10440369 


Homo sapiens 


FLJ000 19 protein 


2370 


99 


1 601 




/\raDiaopsis 
thaliana 


r-Dox protein iamily, Atrr>^3 


1 on 

13 / 


24 


1 60? 


en' 1 Q01 SPSS 


rxuiiiu adpiens 


A \A A P 1 






1602 


ABB05712 


Homo sapiens 


GEHU- Human testis derived protein 
cione tesj I3ni4. 


3720 


100 


1602 

J. VJ V_' 


AAM95 1 34 


AT.UI11VJ odpiCilo 


n u ivi/-\- numan leproQUCtive system 
related antigen SEQ ID NO: 3792. 


1 f\AC 




1603 


A AM95435 


JTXUIXJ.U bapiclXo 


nuivi/\- nun ia 11 reprouuctive system 
related antigen SEQ ID NO: 4093. 


/U^ 


yy 


1603 


AAG01839 


Hntnn cnTiif^YiQ 

XXU11IU odUlOllo 


vjco 1 nunidii se ere tea pro tern, oJcv^ 
ID NO: 5920. 


AAA 
****** 


o<r 

yj 


1603 


m203R095? 




Similar tr\ "RTI^ThXT ^>T\XT A 1 inPlCiCii P1Q 

oiimiar to x\j.jvjj/1> cu in j\ 1 /uuuui^-iy 
gene 


/111 


00 


1604 


gil 199604 


Homo sapiens 


zinc finger protein C2H2-25 


2055 


100 




A AA/T7R076 


riomo sapiens 


ri Y oH- rluman protein ISxiQ ID NU 
1638, 


1 /TCI 

1652 


55 


1 604 


A A1VT70060 


riomo sapiens 


11 1 ors- xiuman protem o-bQ 1JJ JNO 
3606. 


1630 


55 


1 60^ 

a OUJ 




z^ea mays 


LAA 1 -box UNA binding protein 
subunit B (NF-YB) 


72 


32 


1 OUJ 


gl| l^t-y / U 1 DO 
| emu j 
4453 11 


uanio reno 


insulin-like growth factor binding 
protein 1 ■ 


68 


33 


1606 

A \J\J\J 


A AM79754 


nomo sapiens 


n x ojj/- xiuman protein oxiv^ id in u 

3400 


1 / ^ 




1606 


AAM78770 


Homo sapiens 


HYSE- Human protein SEO ID NO 
1432. 


168 


33 


1606 


gil7226682 


Anopheles 
gambiae 


Trex 


106 


27 


1607 


gi4929701 


Homo sapiens 


CGI-1 16 protein 


755 


82 


1607 


gi7677064 


Homo sapiens 


protein x 0009 


755 


82 


1607 


gil5011990 


Homo sapiens 


Similar to CGI-1 16 protein 


750 


82 


1608 


gil 1640582 


Homo sapiens 


MSTP037 


687 


100 


1608 


gi5305335 


Mycobacterium 


proline-rich mucin homolog 


137 


39 
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Table 2B 



SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


Percent 
identity 






tuberculosis 








1608 


gi7363175 


Scmzosaccnarom 
yces pombe 


actm binding protein with SH3 
domains 


130 


33 


1609 


gi2138290 

* 


Homo sapiens 


see GenBank Accession Number 

T T/"\ 1 1 O A J? T"\"VT A - 1 1 1 „ „ 4.—. 

U01 184 for cDNA; similar to 
Drosophila melanogaster flil in 
oent>anK Accession JNumuer uui loz 
and Caenorhabditis elegans flil 
nomoiog m vjciixj diiiv /\t/L/Ci>oiuii 
Number U01183 


6615 


98 


1 60Q 


cri1 096^700 


ITHJIIUJ adpiCIlb 


illgilllCba 1 IlOIIlUlUg ^i^lUbUpillla.^ 


OO 1 J 


OS 


1609 


gi440177 


Homo sapiens 


flightless-I homolog 


6610 


98 


1 O 1 u 




nuino sapiens 


uansciipiion iacior-iiKe proLcm lvirLvjr-A. 






1610 


gi8895210 


Homo sapiens 


MSL3-2 protein 


532 


94 


1 £ 1 n 

101U 


gijyj idd d 


ivius mus cuius 


olQJbop 






1611 


gil2697482 


Homo sapiens 


dJ583Pl 5.7.2 (novel zinc finger protein 
similar to rat kj.in z>r j 


1179 


100 


1611 


gi9843768 


Homo sapiens 


dJ583P 15.7.1 (novel zinc finger protein 
similar to rat kj.in ai ) 


924 


100 


1611 


AAU16398 


Homo sapiens 


HUMA- Human novel secreted protein, 
oeq id j l. 


810 


99 


1612 


gil 8043255 


Mus musculus 


Similar to RIKEN cDNA 4733401D09 
gene 


1590 


53 


1612 


gil4714943 


Homo sapiens 


Similar to outer dense fiber of sperm 
tails 2 


698 


28 


1612 


gil7388906 


Rattus 

• 

norvegicus 


cenexin 2 


604 


27 


1613 


gil 8043255 


Mus musculus 


Similar to RIKEN cDNA 4733401D09 
gene 


1926 


64 ... 


1613 


gil 47 14943 


Homo sapiens 


Similar to outer dense fiber of sperm 
tails z 


604 


27 


1613 


gil7388906 


Rattus 
norvegicus 


cenexin 2 


508 


25 


1614 


AAM93657 


Homo sapiens 


HELI- Human polypeptide, SEQ ID 


2580 


99 


1614 


gil0177621 


Arabidopsis 
tnaiiana 


phytoene dehydrogenase-like 


1103 


49 


1614 


gil7979255 


Arabidopsis 
tnaiiana 


AT5g49550/K6M13_10 


1099 


49 


1O10 


A A T T1 ^Oni 
AAU 1 JyVD 


rlomo sapiens 


nUMA- riuman novel secretea protem, 
Seq ID 856. 


ZdZo 


OO 


101 J 




Homo sapiens 


xllil^l- Human protem sequence oiiv^) 

TT> "MO- 1 1 <J 
Wj INw.lJJlD. 


o 1 no 


1UU 


1615 


gi20 197056 


Arabidopsis 
tnaiiana 


expressed protein 


332 


28 


1010 


a a nnj.m a 


rlomo sapiens 


vjr-bo i Human secreted protem, o.bv^ 
ID NO* 8095 


Zol 




1617 


gi2 13 09945 


Mus musculus 


MRS3/4 


378 


85 


1617 


AAM41505 


Homo sapiens 


HYSE- Human polypeptide SEQ ID 
NO 6436. 


286 


88 


1617 


gil6755528 


Mus musculus 


mitochondrial carrier-like protein 


159 


54 


1619 


AAB94885 


Homo sapiens 


HELI- Human protein sequence SEQ 
ID NO: 16106. 


1454 


99 


1619 


gil 9483920 


Mus musculus 


RIKEN cDNA 2310008M20 gene 


1224 


82 


1619 


AAG01968 


Homo sapiens 


GEST Human secreted protein, SEQ 


644 


94 



